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(57) Abstract 



A decoder decodes directionally encoded audio signals for reproduction via a loudspeaker layout over a listening area. The 
encoded signals are received at an input and decoded by matrix means. The signals are output in a form suitable for reproduction 
via the loudspeaker layout. The coefficients of the decoding matrix are such that at a predetermined listening position in the lis- 
tening area, the reproduced velocity vector direction and the reproduced energy vector position directions are substantially equal 
to each other and substantially independent of frequency in a broad audio frequency range. The gain coefficients of the decoding 
matrix are such that the reproduced velocity vector magnitude r v of the decoded audio signal varies systematically with en- 
coded sound direction at frequencies in the region of and above a predetermined middle audio frequency. In a preferred ex- 
ample, the decoder is an ambisonic decoder for handling, for example, B-format encoded signals. The decoder may be used 
in an audio- visual system, such as HDTV. 
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1 

SURROUND ROUND APPARATUS 

FIELD OF INVENTION 

The present invention relates to techniques for 
5 directionally encoding and reproducing sound, and 
particularly, but not exclusively, to the technique known 
as surround sound and to the provision of improved surround 
sound decoders and reproduction systems using such 
decoders . 

10 The present invention is applicable to a number of 

different surround sound techniques, including Ambisonics, 
and to other encoding techniques. 

BACKGROUND TO THE INVENTION 

15 Ambisonics was developed in the 1970 's and early 

1980' s based on the idea of encoding information about a 
360° directional surround-sound field within a limited 
number of recording or transmission channels, and decoding 
these through a frequency-dependent psychoacoustically 

20 optimised decoder matrix. The matrix is adapted to the 
specific arrangement of loudspeakers in the listening room, 
so as to recreate through that specific layout the 
directional effect originally intended. Examples of 
Ambisonic systems are described and claimed in the earlier 

25 British patents numbers 1494751, 1494752, 1550627 and 
2073556, all assigned to National Research Development 
Corporation, Ambisonic techniques are also described in a 
number of published papers including the paper by M.A. 
Gerzon, "Ambisonics in Multichannel Broadcasting and Video" 

30 published at pp 859-871 of J Audio Eng. Soc. , Vol. 33, No. 
11, (1985 Nov.). 

While known Ambisonic systems have worked extremely 
well, particularly in those cases where at least three 
transmission channels are available, they do nonetheless, 

35 in common with many other sound reproduction systems, 
suffer some limitations particularly with respect to the 
stability of front-stage images. This is a marked 
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disadvantage in particular when it is desired to use 
Aabisonics in an audiovisual system with TV, film or HDTV, 
men it is found that the stability of front-stage images 
is not good enough to give a reasonable match in direction 
5 between the audible and visual image across the whole 
listening area. 

STTMM&RY OF ™* TNVENTION 

According to a first aspect of the present invention, 
10 there is provided a decoder for decoding directionally 
encoded audio signals for reproduction via a loudspeaker 
layout over a listening area, comprising: 

an input for receiving the directionally encoded audio 

signals; . 
15 matrix means for modifying said audio signals; and 

an output for outputting the modified audio signal in 
a form suitable for reproduction via the loudspeakers; 

.the coefficients of said matrix means being such that 
at a predetermined listening position in the listening area 
20 the reproduced velocity vector direction and the reproduced 
energy vector directions are substantially egual to each 
other and substantially independent of frequency in a broad 

audio frequency range, 

characterised in that the gain coefficients of said 
25 matrix means are such that the reproduced velocity vector 
magnitude r v of a decoded audio signal varies 
systematically with encoded sound direction at *^«~~ 
in the region of and above a predetermined middle audio 

30 According to another aspect of the present invention, 
there is provided a surround sound decoder including matrix 
decoding means for decoding a signal having pressure- 
related and velocity-related components thereby providing 
output signals representing feed signals for a plurality of 

35 loudspeakers, in which the values of the coefficients of 
the matrix decoding means are such that the magnitude r v of 
the real narfof the ratio of the reproduced velocity 
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vector gain to the reproduced pressure gain varies with 
azimuthal direction at at least some frequencies. 

Preferably the decoder is an Ambisonic decoder. 
The velocity vector having magnitude r v , energy vector 
5 having magnitude r E and pressure signal P are formally 
defined and discussed in relation to the Ambisonic decoding 
equations in the detailed description and theoretical 
analysis below. 

In all the prior art Ambisonic decoders, a decoding 

10 matrix has been used which is frequency-dependent so as to 
ensure that r v equals 1 at low frequencies and that r E was 
larger at high frequencies. In all such matrices the 
velocity vector had a magnitude which was substantially 
constant for all directions and the pressure signal P had 

15 exactly the same directional gain pattern (as a function of 
encoded azimuth $) at low and high frequencies, apart from 
a simple adjustment of overall gain with frequency. The 
present invention, by contrast, provides an Ambisonic 
decoder arranged to satisfy the Ambisonic decoding 

20 equations in the case where the r v varies with azimuth, 
and, preferably, the directional gain pattern of the 
pressure signal P varies with frequency. Typically, for 
decoders having better front-stage than back-stage image 
stability, the back-sound gain divided by front-sound gain 

25 for the pressure signal will have a smaller value at low 
frequencies (for which r v typically equals 1) than at 
higher frequencies (for which typically r E is maximised 
with a greater value for front- stage sounds than for back- 
stage sounds) . 

30 The present inventors have recognised that the 

directional gain pattern of the pressure signal P (which 
for layouts of speakers at identical distances is the sum 
of the speaker feed signals) can be varied with frequency 
while still giving solutions of the Ambisonic decoding 

35 equations and that this gives an extra degree of freedom 
which may be used to optimise the performance of the 
Ambisonic decoder. In particular, it is found to be 



WO 93/18630 



PCT/GB93/00429 



advantageous to maRe r v vary substantially with encoding 
azimuth 8 rather than to be substantially constant with 
azimuth as it was in the prior art Ambisonic 
This is particularly important in improving the stability 
5 of images. It is found that the degree of image movement 
with lateral movement of the listener is proportional to 1- 
r„ so that the greater the value of r E the less the 
Movement and hence the greater the image stability. It is 
also found that the value of r E tends to be maximised when 
a0 r E substantially equals r v . r E varies with the encoding 
azimuth and so it is found that the best performance is 
obtained by having r v also vary with encoding azimuth so as 
to track the value of r E in so far as this is possible. In 
general r E varies with direction with higher harmonic 
15 components than r v so that only rarely can r E and r v have 
exactly the same values for all azimuths. Nonetheless it 
is found that fairly close matching of the two quantities 
generally gives improved high frequency results. 

Preferably the encoded directional signal is modified 
20 to increase the relative gains of sounds in those 
directions in which the magnitude r E of. the reproduced 
energy vector is largest. 

A further property of Ambisonic decoder systems which 
hitherto has tended to degrade image stability, is the fact 
25 that overall reproduced energy gain E tends to be largest 
when r E is smallest and vice-versa. It is therefore 
advantageous if as well as maximising r E in a desired 
direction in accordance with the first aspect of this 
invention, the gain is modified in a complementary fashion 
30 to counter the loss in energy gain which would otherwise 
occur. in general, such modifications will alter the 
reproduced azimuth so that it no longer equals the encoded 
azimuth 9 but such modifications in practice will not be so 
large as to introduce gross directional distortions m the 
35 reproduced sound image. Moreover, it is found that even if 
the reproduced azimuth does not exactly equal the encoded 
azimuth nonetheless frequency-dependent image smearing can 
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be avoided as long as the decoded azimuth does not vary 
substantially with frequency (at least up to around 3% 
kHz) . 

According to a second aspect of the present invention, 
5 an Ambisonic decoder including decoder matrix means 
arranged to decode a signal having components W, X, Y where 
W is a pressure-related component and X and Y are velocity- 
related components, as herein defined, the matrix decoding 
means producing thereby output signals representing 

10 loudspeaker feeds for a plurality of loudspeakers, further 
comprises transformation means arranged to apply a 
transformation to the signal components W, X, Y thereby 
generating transformed components W , X', Y' for decoding 
by the decoding matrix means. 

15 The transformation may be a Lorentz transformation. 

As described in greater detail below, the sound field 
is preferably encoded using the so called B-format. B- 
format encodes horizontal sounds into three signals, W, X 
and Y where W is an omnidirectional signal encoding sounds 

20 from all azimuths with equal gains equal to 1 and X and Y 
correspond to figure-of-eight polar diagrams with maximum 
gains V2 aligned with the orthogonal X and Y axes 
respectively. This format is shown in Figure 2. These 
signals have the property that for sound from any given 

25 direction 6 the relationship 

2W 2 = X 2 + Y 2 ( 2 ) 

applies. The present inventors have recognised that by 
applying to the original encoded W, X and Y signals a 
transformation of the class known as Lorentz 

30 transformations, signals W, X', Y' result which still 
satisfy the relationship given above. This enables 
manipulation of the signal while retaining the Ambisonic 
properties. B-format can also be extended to full-sphere 
directional signal by adding a fourth upward-pointing 2 

35 figure-of-eight signal as shown in Figure 3. Although for 
clarity this aspect of the invention is described in 
relation to encoding in the horizontal plane it also 
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encompasses such full-sphere W.X.T.I encoding. Similarly 
the other aspects of the invention are also applicable to 

full-sphere encoding- 

Preferably the Lorentz transformation means are 
5 arranged to apply a forward dominance transformation. 

one particular Lorentz transformation termed by the 
inventor the "forward dominance- transformation, and 
defined in detail below, has the effect of increasxng front 
sound gain by a factor X while altering the rear sound gain 
xo by an inverse factor 1/X. A forward dominance 
transformation is particularly valuable with decoders in 
accordance with the first aspect of the invention, since as 
noted above such decoders can otherwise give excessive 
gains for rear sounds. 
15 According to a third aspect of the present invention, 

in a surround sound decoder include decoder matrix means 
arranged to decode a signal and provide output signals 
corresponding to a plurality of loudspeaker feed signals 
the decoder is arranged to output signals representing feed 
20 signals for an arrangement of loudspeakers surrounding . 
listener position comprising at least two pairs of 
loudspeakers disposed symmetrically to the two sides of the 
listener position and for at least one further loudspeaker 
positioned between one of the said pairs of loudspeakers. 
25 Preferably the decoder is an Ambisonic decoder. 

Preferably the at least one further loudspeaker is 
positioned centrally between the two front-stage 
loudspeakers . 

it has long been known that the use of an additional 
30 central loudspeaker to supplement the two ^ P ^ersJor 
the front stage can enhance considerably the stability of 
front-stage images. In view of this, proposed standards 
for HDTV sound invariably incorporate at least one such 
central' loudspeaker in addition to a stereo pair. However 
35 hitherto it has only been possible to find Ambisonic 
decoder solutions for highly symmetrical, e.g. rectangular 
or hexagonal, layouts. The present inventor however have 
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found a solution for decoders suitable for a less symmetric 
layout incorporating one or more central loudspeakers. 
There are listed in further detail below solutions for 
different 5 and 6 speaker layouts together with a general 
5 procedure for finding other such solutions. 

According to a further aspect of the present 
invention, there is provided an Ambisonic decoder including 
matrix decoding means for decoding a signal having 
pressure-related and velocity-related components and 
10 thereby providing output signals representing feed signals 
for a plurality of loudspeakers, in which the values of the 
coefficients of the matrix decoding means are such that the 
directional gain pattern of the pressure-related component 
P of the reproduced signal is different at different 
15 frequencies, the decoder not being a 2.5 channel Ambisonic 
decoder responsive to 3-channel surround sound at some 
audio frequencies and to a 2-channel surround sound at some 
other frequencies. 

According to a further aspect of the present invention 
20 there is provided a decoder including an Ambisonic decoder 
according to any one of the preceding aspects, and further 
comprising means for decoding a supplementary channel E, 
thereby providing improved stability of and separation 
between the front and rear stages. 
25 According to a further aspect of the present invention 

there is provided a decoder including an Ambisonic decoder 
according to any one of the preceding aspects, and further 
comprising means for decoding a supplementary channel F 
thereby providing a further output signal for use in 
3 0 cancelling crosstalk between front and rear stages. 

Preferably E is encoded with gain k e ( l-c e ( 1-cos $)) for 
I $\ <9 S and gain 0 for i $\ >d s and F encoded with gains 2* 
k f sin 9 for I d\ < 6 S , gains -2* k b sin 0 for I l80--d < * B and 
gain 0 for other 0, where 6 S is the half -width of a frontal 
35 stage which may typically be 60% 0 S is the half-width of 
a rear stage which may typically be 70% and k e , k f and k b 
are gain constants which may be chosen between 0 (for pure 
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B-format) and a value equal to or in the neighbourhood of 
1 (for reproduction effect purely in front and rear 
stages). Preferably c e lies between substantially 3 and 
3 5 and more preferably is equal to substantially 3.25. 
5 This aspect of the present invention provides a 

decoder which gives a further improvement in frontal-stage 
image stability. While the Ambisonic decoders of the 
preceding aspects of the present invention in themselves 
give a significant improvement in stability even greater 

10 stability may still be desirable when, for example, the 
decoder is to be used in an HDTV system. The present 
inventor has found that by adding at least one additional 
channel signal E to provide a feed for a centre-front 
loudspeaker, a system results which offers much of the 

15 image stability attainable with the three or four-speaker 
frontal stereo systems known in the art, while retaining 
the flexibility of the Ambisonic decoders in providing 
optimally decoded results via a variety of speaker layouts. 
Moreover, it is found that the 5 or 6-speaker Ambisonic 

20 decoders of the present invention provide a far better 
basis for such a system enhanced by a supplementary channel 
than do the conventional 4-speaker Ambisonic decoders known 
in the art. 

in the preferred implementation of this aspect of the 
25 invention, two supplementary channels E, F are used in 
addition to channels W, X, Y to provide a format termed by 
the inventor "enhanced B-formaf. This enhanced format xs 
described fully in the detailed description below. 

The present invention in its different aspects is 
30 applicable to audio signals that are directionally encoded. 
Directionality of sounds can be encoded in various ways 
using two or more related audio signal channels. In all of 
these methods, each encoded direction of sound is mixed 
into the audio signal channels with gains (which may be 
35 real or complex, and may be independent of or dependent on 
frequency) on each signal channel whose values as a 
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function of direction characterise the directional encoding 
being used. 

An overall real or complex gain change applied equally 
to all audio channels does not change the directional 
5 encoding of a sound, but only the gain and phase response 
of the sound itself. Thus, directional encoding is 
characterised by the relative gains with which a sound is 
mixed into the audio signal channels as a function of 
intended direction. 

10 Examples of directional encoding include the familiar 

case of conventional amplitude stereophony in which the 
direction of sounds in two stereo channels is encoded by 
the relative amplitude gain in two channels normally 
intended for reproduction via respective left and right 

15 loudspeakers. The means of encoding gains as a function of 
direction often used in studio mixing is the device known 
as a stereo panpot which allows alteration of the encoded 
stereo direction by giving adjustment of the relative gains 
of a sound in the left and right channels. An alternative 

20 means of encoding directionally often used is a coincident 
stereo microphone recording array, where coincident 
directional microphones pointing in different directions 
are used, and sounds recorded in different directions 
around such a microphone array will be encoded into the two 

25 channels with different gains determined by the gain 
response of the two microphones for that incident sound 
direction. 

Conventional two channel stereo is only the simplest 
of many directional encoding methods known in the prior 

30 art. The invention is particularly applicable to methods 
of surround sound decoding cover a 360° sound stage of 
directions, such as the methods known as B-format or UHJ 
described in M.A. Gerzon, "Ambisonics in Multichannel 
Broadcasting and Video", J. Audio Eng. Soc. , Vol. 33, No. 

35 11, pp. 859-871 (1985, Nov.) or to other prior art methods 
such as BMX directional encoding described later in this 
description. 
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10 



10 

These preferred methods of surround sound directional 
encoding with which the invention may be used encode 
horizontal directions as linear combinations of three 
signals, W with constant gain 1 as a function of direction, 
and directional signals X and Y whose gains as a function 
of encoded direction follow a figure-of-eight or cosine 
gain law pointing in two orthogonal directions. For 
example, in B-format, X may be chosen to have a gain V2 
cosfl and Y a gain V 2 sinfl, where 8 is the angle of a 
direction measured anticlockwise from due front in the 
horizontal plane. The directionality for B-format can be 
encoded either by a suitable B-format panpot such as has 
been described by M.A. Gerzon & G.J. Barton, "Ambisonic 
Surround Sound Mixing for Multritrack Studios", Conference 
15 Paper C1009 of the 2nd Audio Engineering Society 
international Conference, Anaheim (1984 May 11-14), or by 
means of a sound field microphone giving a B-format encoded 
output in response to incident sounds. 

Alternatively, any three independent linear 
combinations of the signals W, X and Y may be used to 
encode directionality, since B-format signals may be 
recovered from these by using a suitable inverse 3 x 3 
matrix. Any decoder for reproducing B-format may be 
converted to one for three such linear combination signals 
by preceding or combining the B-format decoder with an 
appropriate 3 x 3 matrix having the effect of recovering B- 

format signals. 

360» directionality may also be encoded into just two 
channels as complex linear combinations of W, X and Y. For 
example, in the prior art in one of the inventors' British 
Patent 1550628, systems encoding direction are considered 
that use two independent linear combinations of channels 
whose gains as a function of directional angle 6 are 
2 m a + bcosfl + jcsinfl 
A = je + jfcosfl + gsinfl 
where the coefficients a, b, c d, e, f, g are real gain 
coefficients and where j = V(-l) represents a relative 
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11 

broadband 90° phase difference, which may be implemented by 
known 90° phase difference all-pass phase shift networks. 
As is well known in the prior art, such directionally 
encoded 2-channel signals may be derived from B-format 
5 signals by means of a phase -amplitude matrix incorporating 
90° phase difference networks. 

The invention is also applicable to the decoding of a 
broad range of such 2-channel directionally encoded sound 
signal channels, including the prior art BMX and UHJ 
10 systems, which are of this form. It is also applicable to 
conventional 2-channel amplitude stereophony encoding, 
since this directional encoding may be converted to a BMX 
encoding by inserting a 90« phase difference between the 
two channels. 

15 The invention may also be applied to signals in which 

additional to a 360° azimuthal encoding, directions in 
three dimensional space are encoded including for example 
elevated sounds, such as are described in M.A. Gerzon, 
"Periphony: With-Height Sound Reproduction", J. Audio Eng. 

20 Soc, Vol. 21, pp. 2-10 (1973, Jan. /Feb.) and in the cited 
1985 Gerzon reference. Additional directionally encoded 
channels may be added, such as a signal 2 with vertical 
figure-of-eight directional gain characteristic, or the 
directional enhancement signals E and F described elsewhere 

25 in this description. 

The invention is additionally applicable to other 
systems of directional encoding having substantially the 
same relative gains between signal channels as the systems 
described above, even if their overall or absolute gains or 

30 phases as a function of encoded direction varies. 

As already noted, the decoders of the present 
invention while being generally applicable have particular 
advantages when used with audiovisual systems. The present 
invention also encompasses TV, HDTV, film or other 

35 audiovisual systems incorporating a decoder in accordance 
with any one of the preceding aspects in its sound 
reproduction stages. 
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Decoders according to the invention may be implemented 
using any known signal processing technology in ways 
evident to those skilled in the art, and in particular 
either using electrical analogue or digital signal 
processing technology, or a combination of the two. 

in the electrical analogue case, matrix networks or 
circuits may be implemented using resistors or voltage or 
digitally-controlled active gain elements to implement 
matrix gain coefficients in combination with active mixing 
devices such as operational amplifiers to perform addition 
or subtraction of signals. Frequency-dependent elements 
such as cross-over network filters may be implemented using 
any familiar active filter topology. Good approximations 
to relative 90- phase difference networks may be 
implemented by pairs of all-pass networks each comprising 
cascaded first order all-pass poles of the kind extensively 
described in the previous literature on, for example, 
quadraphonic or surround-sound phase-amplitude matrixing or 
on single sideband modulation using quadrature filters. 

in the case where it is preferred to use digital 
signal processing to implement decoders, analogue-to- 
digital converters may be used to provide signals in the 
sampled digital domain, and the decoders may be implemented 
as signal processing algorithms on digital signal 
processing chips. In this case, filters may be implemented 
using digital filtering algorithms familiar to those 
skilled in the art, and matrices may be implemented by 
multiplying digital signal words by gain coefficient 
constants and summing the results, -me digital outputs may 
be converted back to electrical analogue form by using 
digital-to-analogue converters. 

It will be understood by those skilled in the art that 
matrix, gain and filter means in decoders may be combined, 
rearranged and split apart in many ways without affecting 
the overall matrix behaviour, and the invention is not 
confined to the specific arrangements of matrix means 
described in explicit examples, but includes, for example, 
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13 

all functionally equivalent means such as would be evident 
to one skilled in the art. 

The outputs of decoders will typically be fed to 
loudspeakers using intermediary amplifier and signal 
5 transmission stages which may incorporate overall gain or 
equalization adjustments affecting all signal paths 
equally, and also any gain, time delay or equalization 
adjustment that may be found necessary or desirable to 
compensate for the differences in the characteristics of 
10 different loudspeakers in the loudspeaker layout or for the 
differences in reproduction from the loudspeakers caused by 
the characteristics of the acoustical environment in which 
the loudspeakers are placed. For example, if the 
reproduction from one loudspeaker is found to be deficient 
15 in a given frequency band relative to the reproduction from 
the other loudspeakers, a compensating boost equalisation 
may be applied in that frequency band to feed that 
loudspeaker without changing the functional performance of 
the decoder according to the invention. 
20 Especially, but not only in public address 

applications of the invention, decoders may be fed to 
loudspeaker layouts using loudspeakers covering only a 
portion of the audio frequency range, and different 
loudspeaker layouts may be used for different portions of 
25 the audio frequency range. In this case, the directionally 
encoded audio signals may be fed to different decoder 
algorithms for loudspeaker layouts used in different 
frequency ranges by means of cross-over networks. 

As with prior art Ambisonic decoders, it is a 
30 characteristic of decoders according to the invention that 
for any given directional encoding specification, the 
matrix algorithms used to derive signals suitable for 
feeding to loudspeakers depends on the loudspeaker layout 
used with the decoder, as will be described in more detail 
35 below and in the appendices. 

It is therefore desirable that the decoder should 
incorporate or be used with means of adjusting the decoder 
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matrix coefficients in accordance with the loudspeaker 
layout it is intended to use with the decoder, so that 
correct directional decoded results may be obtained. For 
example, as disclosed in one of the inventors British 
Patent 1494751 filed 1974, March 26, prior art Ambisonic 
decoders for rectangular loudspeaker layouts incorporated 
gains in two velocity signal paths in decoders as a means 
of adjusting for different shapes of rectangle, and in 
commercially available decoders this is implemented by 
means of a potentiometer adjusting the gain of two velocity 
signal paths whose settings are calibrated either with 
pictures of the layout shape or with the ratio of the two 
sides of the rectangle. In a similar way, one of the 
inventors British Patent 2073556 filed 1980 discloses the 
15 provision of gain adjustments in velocity signal paths in 
decoders for certain loudspeaker layouts where loudspeakers 
are disposed in diametrically opposite pairs. 

In general, loudspeaker layout control means may 
constitute a number of adjustable matrix coefficients in 
20 the decoder linked to a means of adjusting these in 
accordance with an intended or actual reproduction 
loudspeaker layout. The adjustment means may constitute 
potentiometers or digitally or voltage controlled gain 
elements in analogue implementations or a means of 
25 computing or looking up in a table the matrix coefficients 
in a digital signal processor, and a means incorporating 
these coefficients in a signal processing matrix algorithm. 

The method of adjustment may be in response to a 
control menu specifying the shape of the loudspeaker 
30 layout, or one or more controls adjusting analogue 
parameters defining the loudspeaker layout shape, or- a 
combination of these, or any other well known means of 
adjusting parameters in signal processing systems. The 
loudspeaker layout may be determined by geometrical 
35 measurements, for example with a measurement tape, or by 
any known automatic or semi-automated measurement technique 
such as those used to determine distance in autofocus 
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cameras. In the automated case, the results of the 
measurements may be used to compute appropriate matrix 
coefficients, for example by interpolation between the pre- 
computed values of matrix coefficients on a discrete range 
5 of loudspeaker layouts computed by the methods indicated in 
the appendices. 

In the prior Ambisonic art, the layout control 
adjustment of matrix coefficients has the effect of 
altering only signals represented reproduced velocity, but 

10 not signals representing the reproduced pressure. In 
contrast , for many loudspeaker layouts to which the present 
invention is applicable, including those with more 
loudspeakers disposed across a frontal stage than across a 
diametrically opposed rear stage, the layout control 

15 adjustment of matrix coefficients has the effect of 
altering not only signals representing reproduced velocity, 
but also signals representing the reproduced pressure as 
well, as may be seen by computing the pressure signal 
(which is the sum of the loudspeaker output signals) for 

20 various loudspeaker layouts disclosed in the appendices. 

The invention may be used in conjunction with the 
methods disclosed in British Patent 1552478 to compensate 
for different loudspeaker distances from a preferred 
listening position in the listening area. The decoding 

25 matrices of the present invention may be combined with time 
delays and gain adjustments for the output loudspeaker feed 
signals that compensate for the altered time delay and 
gains of sounds arriving at the preferred listening 
position caused by unequal loudspeaker distances from the 

30 preferred listening position. 

BRTEF DESCRIPTION OF THE DRAWINGS 

The present invention will now be described in further 
detail with reference to the accompanying drawings, in 
35 which: 

Figure 1 is a diagram illustrating an encoding/ 
reproduction system; 
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Figure 2 is a diagram illustrating the coordinate 
conventions ; 

Figure 3 is a polar response diagram for horizontal B 
format ; 

5 Figure 4 is a polar response diagram for full sphere 

B format; 

Figure 5 is a diagram illustrating a forward dominance 

transformation; 

Figures 6 and 7 are diagrams showing alternative 5- 

10 speaker layouts; 

Figures 8 and 9 are diagrams showing alternative 6- 

speaker layouts; 

Figure 10 is a diagram showing the architecture of a 
B-f ormat Ambisonic decoder for the layouts of Figures 6 to 
15 9; 

Figure 11 shows the architecture of a decoder for an 
enhanced Ambisonic signal incorporating E & F channels; 
Figure 12 shows a rectangular speaker layout; 
Figure 13 shows a 2-channel Ambisonic decoder for the 
20 layout of Figure 12; and 

Figure 14 shows an example of a 2-channel Ambisonic 

decoder • 



25 



30 



35 



DETAILED nRSCRIPTIO " ™ ™RnnTMENTS 

Figure 1 is a block diagram illustrating a typical 
ambisonic encoding/reproduction chain. An incident sound 
field is encoded by a SoundField microphone 1 into B-Format 
signals- The resulting WXY signals are applied to an 
ambisonic decoder. The ambisonic decoder 2 applies to the 
WXY signals a decoding matrix which derives output signals 
from weighted linear combinations of the W X and Y signals. 
These output signals are then amplified by amplifiers 3 and 
supplied to speakers 4 arranged in a predetermined format 

around a listener 5. 

The sound field microphone 1 is a one-microphone 
system such as that currently commercially available from 
AMS as the Mark IV SoundField microphone. 
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Figure 3 shows the horizontal polar diagrams of the B- 
format signals. As noted above, B-format can be extended 
to include four full-sphere directional signals as shown in 
Figure 4. 

As an alternative to direct encoding of sound using a 
SoundField microphone, it is also possible to produce 
horizontal B-format signals using a B-format panpot which 
feeds an input signal directly to the W output with gain l, 
and uses a 360° sine/cosine panpot with an additional gain 
2* to feed the respective Y and X outputs. This is 
described in the paper by the present inventors "Ambisonic 
Surround-Sound Mixing for Multi Track Studios" Conference 
Paper C1009, 2nd AES International Conference "The Art and 
Technology of Recording", Anaheim, California (1984 May Il- 
ls 14) . 

The ambisonic decoding equation used to derive the 
loudspeaker feed signal from the WXY signals is determined 
for a given loudspeaker layout in accordance with certain 
psychoacoustic criteria formally represented by the so- 

20 called Ambisonic equations. As described in further detail 
below, these define different constraints appropriate to 
different frequency bands for the reproduced sound in terms 
of the gnp-rgv vector and velocity, vector together with the 
scalar pressure signal P. 

25 The localisation given by signals emerging with 

different gains g, from different loudspeakers around a 
listener can be related to physical quantities measured at 
the listener location. In particular, it can be shown that 
localisation given at low frequencies by interaural phase 

30 localisation theories below about 700 Hz is determined by 
the vector given by dividing the overall acoustical vector 
velocity gain of a reproduced sound at the listener by the 
acoustical pressure gain at the listener. In the case of 
complex signals, the real part of this vector is used. The 

35 resulting vector, for natural sound sources, has length one 
and points at the direction of the sound source. For 
sounds reproduced from several loudspeakers, the length r v 
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of this vector should ideally be as close to 1 as possible, 
especially for sounds intended to be near azimuths ±90«, 
and the azimuth direction tf v of this vector is an 
indication of the apparent sound direction. 

Between about 700 Hz and about 4 kHz (and these 
figures are merely rather fuzzy indications) , and also for 
non-central listeners hearing mutually phase-incoherent 
sound arrivals from different speakers below 700 Hz, 
localisation is determined by that vector which is the 
ratio of the vector sound- intensity gain to the acoustical 
energy gain of a reproduced sound. Again, for natural 
sound sources, this vector would have length one and point 
to the sound source. For reproduced sounds, the length r E 
of this vector should be as close to one as possible (it 
can never exceed 1) for maximum stability of the image 
under listener movement, and its direction azimuth 0 E is an 
indication of the apparent direction of the image. 

These vector quantities can be computed from a 
knowledge of the gains g, with which a sound source is fed 
to each of the loudspeakers, as follows. Suppose one has 
n loudspeakers all at equal distances . from the listening 
position; let the i'th loudspeaker be at azimuth 6, and 
reproduce a sound with gain g,. (While the theory can be 
developed for complex gains g,. we here assume that g, is 
real for simplicity) . The acoustical pressure gain is then 
simply the sum 

(5) 

of the individual speaker gains. The velocity gain is the 
vector sum of the n vectors with respective lengths g, 
pointing towards azimuth 9, (i.e. towards the associated 
loudspeaker) , which has respective x- and y-components 

(6x) 

and 

(ey) 

n 
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By dividing this velocity gain vector by the pressure gain 
p, one obtains a velocity i realisation vector of length r v > 
0 pointing in direction azimuth 0 V , where 

r v cos0 v = V x /P (7x) 
5 r v sin0 v = V Y /P . (7y) 
0 v is termed the velocity vector localisation azimuth, or 
Makita localisation azimuth, and is the apparent direction 
of a sound at low frequencies if one turns one's head to 
face the apparent direction. r v is termed the velocity 
10 vector magnitude and ideally equals one for single natural 
sound sources. The two quantities 0 V and r v are indicative 
of apparent localisation direction and quality according to 
low-frequency interaural phase localisation theories, with 
deviations of r v from its ideal value of one indicative of 
15 image instability under head rotations, and poor imaging 
quality particularly to the two sides of a listener. 

A similar procedure is used according to energy 
theories of localisation, but with the square | gi | 2 of the 
absolute value of the gain from each speaker replacing the 
gain g { . The overall reproduced energy gain is 



(8) 

and the sound-intensity gain is the vector sum of those 
vectors pointing to the i'th speaker with length |g,| , 
which has x- and y- components 

<9x) 

and r. 

in (9y) 
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By dividing this sound-intensity gain vector by the overall 
energy gain E, one obtains an energy. Realisation Y£c£2E of 
length r e z0 pointing towards the direction azimuth «„ where 
r E cos* E = E x /E <j£> 

5 r E sinfl E = V E * * . 

0 is termed the energy vector lasaligafeiea azimuth,, and is 
broadly indicative of the apparent localisation direction 
either between 700 Hz and around 4 kHz, or at lower 
frequencies in the case that the sounds arrive in a 

10 mutually incoherent fashion at the listener from the n 
loudspeakers . 

r is termed the energy, vector magnitude of the 
localisation, and is indicative of the stability of 
localisation of images either in the frequency range 700 Hz 
15 to 4 kHz or at lower frequencies under conditions of phase- 
incoherence of sound arrivals. As before with r v , the 
ideal value for a single sound source is equal to 1. 
Because r E is the average (with positive coefficients 
|g | 2 /(E|g| 2 )) of n vectors of length 1, it is only equal 
20 to' 1 if all sound comes from a single speaker. Generally 
r E is less than 1, and the quantity l-r E is roughly 
proportional to the degree of image movement as a listener 
uoves his/her head. Ideally, for on-screen sounds with 
HDTV, one would like l-r E < 0.02, but one finds that 
25 typically for central stereo images with 2-speaker stereo 
that l-r E = 0.134, and for surround-sound systems that l-r E 
lies between 0.25 and 0.5. 

For frontal stage stereo systems subtending relatively 
narrow angles (say with stage widths of less than 60-) , it 
is found that the value of r v is not critical providing 
that it lies between say 0.8 and 1.2, but that the value of 
r is an important predictor of image stability. For 
surround sound systems aiming to produce images at each 
side of a listener, however, making r v equal one accurately 
'35 at low frequencies becomes much more important, since the 
low-frequency localisation cue is one of the few cues that 
can be made correct for such side-stage images, and the 



30 



WO 93/18630 PCT/GB93/00429 



21 

accuracy of such localisation depends critically on the 
accuracy of r v . 

It has thus been found that the localisation criteria 
for front-stage stereo and for surround sound are somewhat 
5 different in their practical trade offs. 

For all methods of reproduction, it has been found 
that it is desirable that the two localisation azimuths 0 V 
and 6 E should be broadly equal, so that any decoding method 
should ideally be designed to produce speaker feed gains g f 
10 for all localisation azimuths such that 

* v = e t (id 

at least for frequencies up to around 3% or 4 kHz. This 
ensures that different auditory localisation mechanisms 
give broadly the same apparent reproduced azimuth, 

15 especially in those frequency ranges in which more than one 
mechanism is operative. Equation (11), which is an 
equation relating the quantities g f via equations (5) to 
(10), can be written in the form 

E x V y = E y V x (12) 

20 and is seen in general to be cubic in the gains g f . If 
equations (11) or (12) are satisfied, there is a tendency 
for illusory phantom images to sound more sharp and precise 
than if 0 V and 0 E differ substantially. 

However, sharpness is not the same as image stability, 

25 and additional requirements on r v and r e are necessary for 
optimum imaging stability. For surround sound systems, it 
is highly desirable, under domestic scale listening 
conditions, that 

r v = 1 d3) 
30 for all reproduced azimuths at low frequencies, typically 
under 400 Hz, at a central listening location. However, 
above 400 Hz, it is instead desirable that the value of r E 
be maximised. With some exceptions, it is generally not 
possible to design a reproduction system to be such as 
35 simultaneously to maximise r E in all reproduced directions, 
so that in practice, some design trade off is made between 
the values of r E in different reproduced directions. In 
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general, for surround-sound systems, r E above 400 Hz is 
designed to be larger across a frontal stage than in side 
and rear directions, but not to the extent that side and 
rear sounds become intolerably unstable. 
5 The optimisation of r E above about 400 Hz is partly a 

matter of design skill and experience obtained over a 
period of years, but some of this skill can be codified as 
informal rules of thumb. It is generally highly 
undesirable that l-r E should vary markedly in value for 
10 sounds at only slightly different azimuths, since such 
variations will cause some sounds to be much more unstable 
than other near by ones. In general, it is desirable that 
r be maximised at the due front azimuth or across a 
frontal stage, and it is desirable that the values of r E in 
15 other directions vary smoothly. 

A decoder or reproduction system for 360- surround 
sound is defined to be Mbisanic if, for a central 
listening position, it is designed such that 

(i) the equations (11) or (12) are satisfied at least 
20 up to around 4 kHz, such that the reproduced azimuth 0 ¥ - 
0 E is substantially unchanged with frequency, 

" (ii) at low frequencies, say below around 400 Hz, 
equation (13) is substantially satisfied for all reproduced 
azimuths, and 

25 (iii) at mid/high frequencies, say between around 700 

Hz and 4 kHz, the energy vector magnitude r E is 
substantially maximised across as large a part of the 360' 
sound stage as possible. 

In large reproduction environments, such as auditor la, 

30 it is unlikely that a listener will be within several 
wavelengths of a central listening seat; under these 
conditions, the requirement of equation (13) is desirably 
not satisfied, although it is still found that satisfying 
equations (11) or (12) gives useful improvements in phantom 

35 image quality. 

The Ambisonic decoding equations (11) to (13), plus 
the requirement for maximising r s above 400 Hz, are in 
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general a highly nonlinear system of equations. Prior-art 
solutions to these equations involved the use of 
loudspeaker layouts with a rather high degree of symmetry, 
e.g. regular polygons, rectangles, or involving 
diametrically opposite pairs of loudspeakers but the new 
solutions in accordance with the present invention apply to 
much less symmetrical speaker layouts. 

Previously, in finding solutions for the Ambisonic 
decoder solutions have been selected such that the 
reproduced acoustical pressure gain P, as defined above had 
a directional gain pattern as a function of encoded azimuth 
9 which was the same at low and high frequencies, apart 
from a simple adjustment of overall gain with frequency, 
in the presently described decoders, by contrast, the 
15 values of the decoding matrix is such that while the output 
from the speakers satisfies the Ambisonic decoding 
equations, different directional gain patterns for the 
pressure signal P result at different frequencies. A 
further characteristic of the decoding matrixes is that 
20 they result in the magnitude of the velocity vector r v 
varying systematically with encoding azimuth 6 rather than 
being substantially constant with azimuth as in previous 
Ambisonic decoders. In particular, r v is made 

substantially to track r E in a mid-high frequency range of 
25 e.g, 700 Hz to 4 kHz while in a low frequency range up to 
e.g 400 Hz r v is as far as possible equal to one. 

It is found that the use of a decoding matrix having 
these characteristics gives markedly improved image 
stability. Moreover, it is possible in addition to find 
solutions for a decoder to feed a loudspeaker layout 
incorporating additional central loudspeakers such as the 
five or six - speaker layouts illustrated in the Figures. 
The derivation of solutions for such layouts will be 
described in further detail below. 

One characteristic of the Ambisonic decoding matrices 
is that they increase the gain of those signals for which 
lowest, which will typically be the rear-stage 
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signals. Accordingly, to counter this effect, the decoder 
is arranged to apply a forward dominance transformation to 
the B-format signal W,X,Y. This is a Lorentz 

transformation which produces transformed signal components 

F/'=l/2 U+X- 1 ) tf+8" 1/2 U-X-DX 

5 

(*♦*-*) r*-*" 

Y'=Y 

...(3) 

W X' Y' satisfying the above equation where X is a real 
10 parameter having any desired positive value. 

It follows from the above relationship that a due- 
front B-format sound with W,X,Y gains of 1, 2* and 0 
respectively is transformed into one with a gain X times 
larger whereas a due-rear sound with original gains 1,-2 
and 0 respectively is transformed into a rear sound with 
gain multiplied by X" 1 . Thus this forward dominance 
transformation increases front sound gain by factor X 
whereas it alters rear sound gains by an inverse factor 1/X 
and the relative gain of front to back sounds is altered by 
a factor X 2 which allows the relative gain of reproduction 
of rear sounds to be modified to reduce (or increase) their 
relative contributions. 

This use of forward dominance control is important in 
various applications of B-format to HDTV. In a production 
application, it can be used to de-emphasise sounds from the 
rear of a sound field microphone while still giving a true 
B-format output. However, it can also be used in 
different reproduction modes relying on B-format input 
signals to de-emphasise rear sounds. In particular, in the 
new B-format Ambisonic surround-sound decoders of the 
• present invention which may otherwise give excessive gain 
for rear sounds can be compensated for by a judicious 
application of a compensating forward dominance. 
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Besides altering the front-to-rear level balance, 
forward dominance also alters the directional distribution 
and azimuths of sounds (other than those at due front and 
directions). Figure 4 shows the effect of forward 
dominance with X=A Without going into the detailed 
analysis, it can be shown that an original azimuth 8 is 
transformed into a new azimuth 6' given by the equation 



, u+cos6 
cose/= i%cos9 



...(4a) 

10 where 

It = (X 2 -D/(X 2 +1). 1 ' 

If X >1, then all directions are moved towards the front, 
and if X<1, all directions are moved towards the back by 
the forward dominance transformation of equation (3) . The 

15 width of a narrow stage around due front is multiplied by 
a factor 1/X , and of a narrow stage around the back is 
multiplied by a factor X , as shown in Figure 4. by this 
transformation, so that forward dominance is a kind of B- 
format "width control" that narrows the front stage as it 

20 widens the rear stage, or vice-versa. The relative front- 
to-back amplitude gain X 2 , expressed in decibels, is termed 
the ..dominance gain", so that X = 2* is said to have a 
dominance gain of + 6.02! dB. This dominance gain causes 
images at the sides (azimuths ±90') to move forward by an 

25 angle of sin" » = 19.47- in the B-format sound stage, via 
eauation (4) - 

Although for simplicity the forward dominance 
transformation may be considered as a separate operation 
carried out on the input W,X,V signals, before the 
30 transformed signals are applied to the decoding matrix, in 
practice both the transformation and the decoder may be 
■ carried out by a single matrix. 

considering now in detail the derivation of the 
coefficients for the decoding matrix in the enhanced 
35 Ambisonic decoders, Figures 6-9 show typical speaker 
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iayouts which will be considered for 360° surround-sound 
reproduction. Figure 6 shows a rectangular speaker layout 
using left-back L,, left-front L f , right-front R F and nght- 
back Rs speakers at respective azimuths 180°-*, *, -« and - 
5 180°+*, supplemented by an extra centre-front C F 
ioudspeaker. Figure 7 shows a similar 5-speaker layout, 
except that now the azimuth angles *, of the front pair 
differs from that 180° ± * B of the rear pair, so that the 
L F , R f and R, speakers form a trapezium layout. 
10 Figures 8 and 9 show similar rectangle and trapezium 

speaker layouts respectively, but this time supplemented by 
a frontal pair of speakers C L and C R at respective azimuths 

+*. and -* c . , 

There is already a long-known Ambisonic decoder for B- 

15 format for the 4-speaker rectangular layout shown t n 
Figures 6 or 8, for which * v = 6 e = 6 for all encoded 
azimuths 6. However, this decoder has identical r E for due 
front and due back sounds above about 400 Hz, and for 
square loudspeaker layouts has r E equal to 0.7071 in all 

20 directions, which is not adequate for frontal-stage sounds 
for use with TV. However, it is possible to show that for 
the rectangular layouts of Figures 6 and 8, there are other 
fcnbisonic decoders that feed the additional frontal 
speakers so as to increase r E for front-stage sounds, at 

25 the expense of slightly decreasing r E at the sides and 

rear. . , 

Although the speaker layouts of Figures 6 to 9 lack a 
high degree of symmetry, they are still left/right 
symmetrical, i.e. symmetrical under reflection about the 

30 forward direction. We assume here that we are considering 
a left/right symmetric shaker layout in which all speakers 
lie at the same distance from a listener. We seek to find 
for the various speaker layouts of this kind those real 
left/right symmetrical linear combinations of the B-format 

35 signals W, X and Y such that the equations 

e v = e, = e < 25 > 
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are satisfied for all encoding azimuths 6 in the 360* sound 
stage. Having found all such solutions, the next step is 
to find among those solutions ones with r v = 1 for low 
frequencies and those with maximised r E at higher 
5 frequencies, and to use a frequency-dependent matrix to 
implement these two matrices in a frequency-dependent 
manner as an Ambisonic decoder. The decoder architecture 
we now describe, and the associated methods of solution 
described in Appendix A works for quite general left/right 
10 symmetric speaker layouts, although the numerical details 
of the solution process can be extremely messy in 
particular cases, requiring the use of powerful computing 
facilities. 

In order to take advantage of left /right symmetry, it 
15 is convenient to express the speaker feed signals 
illustrated in Figures 6 to 9 in sum and difference form as 
follows: 

L F = M F + S f 
R F = M F - S F 

20 Lb = ^ + s b 

Re = 1^ - S 8 



C F + S c 



(26) 



25 Because of the left /right symmetry requirement, at any 

frequency one can write the signals C F , S c , M F , S F , Mb and S B 
in terms of B-format in the following form: 

S c - k c Y 

S F =* k F Y 

30 S B = k B Y , 

C f = a c W + b c X 
M F - a F W + b F X 

M B = a B W - b B X , < 27 > 

35 where k c , k F , k B , a c , a F , a B , b c , b F , b B are real coefficients 
(which typically will all be positive, excepting k c which 
may be zero) . 
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Figure 10 shows the general architecture of an 
Ambisonic decoder for the speaker layouts of Figures 6 to 
9 based on equation (27). At the B-fonnat input, there is 
provided optionally a forward-dominance adjustment 

5 according to equation (3) so that the relative front/back 
gain balance and directional distribution of sounds can be 
adjusted. Each of the three resulting B-format signals is 
then passed into a phase compensated band-splitting filter 
arrangement, such that the phase responses of the two 

10 output signals are substantially identical. Typically for 
domestic listening applications, the cross-over frequency 
of the phase-compensated band-splitting filters will be 
around 400 Hz, and the sum of low and high frequency 
outputs will be equal to the original signal passed through 

15 an all-pass network with the same phase response. For 
example, the low-pass filters in Figure 10 might be the 
result of cascading two RC or digital first order low-pass 
filters with low frequency gain 1, and the high-pass 
filters might be the result of cascading two first order 

20 high pass filters with the same time constants, with high 
frequency gain of -1; these filters sum to a first order 
all-pass with the same time constant, and have identical 

phase responses. 

The low-frequency B-format signals resulting are fed 

25 to a low-frequency decoding matrix to implement equation 
(27) for coefficients appropriate below 400 Hz (typically 
ensuring that r v = l) , and the high-frequency B-format 
signals are fed to a second high-frequency decoding matrix 
to implement equations (27) ' for a second set of 

30 coefficients appropriate to the higher frequencies at which 
r is to be maximised. The resulting low and high 
frequency signals C F , s c (where it exists), M f , S„ M, and 8, 
are then summed together and fed to output sum and 
difference matrices to provide speaker feed signals 
35 suitable for the speaker layouts of Figures 6 to 9. In the 
case of 5-s P eaker layouts such as those of Figures 6 or 7, 
or in the case of 6-speaker layouts in the case that C t = 
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C F = C F , the S c signals path and the top sum-and-dif ference 
matrix in Figure 10 may be omitted. 

The use of phase compensation (i.e. phase matching) of 
the band-splitting filters in Figure 10 is found to be 
highly desirable for surround sound decoders, since any 
•-phasiness" errors due to relative phase shifts between 
signal components are magnified by the large 360» angular 
distribution of sounds, although in some cases, the use of 
filters that are not phase matched may prove acceptable. 
It is also clear that the architecture of Figure 10 can be 
extended to 3 or more frequency bands by using a three-band 
phase-splitting arrangement with three decoding matrices, 
so as to optimise localisation quality separately in three 
or more bands. Typically a three band decoder might have 
15 crossover freguencies at 400 Hz and at or around 5 to 7 kHz 
so as to optimise localisation in the pinna-colouration 
frequency region above about 5 kHz. 

It is also evident that, rather than bandsplitting 
into, say, low and high frequency bands as in Figure 10, 
20 other bandsplitting arrangements can be used, e.g. an all- 
pass path feeding high frequency decoding matrix 
coefficients and a phase-matched low-pass path feeding a 
decoding matrix whose coefficients are the difference 
between the low and high frequency coefficients. 
25 Similarly, part or all of the output sum and differencing 
process might be implemented in the decoding matrices 
before the band-combining summing process. Such variations 
on the architecture shown in Figure 10 are relatively 
trivial practical modifications that would be evident to a 

30 skilled designer. 

in particular, the forward dominance adjustment might 
be implemented directly as modified coefficients a c , b c , a F , 
b F , a B , b B rather than or in addition to an input forward 
dominance matrix. 

Besides possibly implementing forward dominance and 
overall gain adjustments, the decoding matrices in Figure 
10 will, in general, have matrix coefficients that vary 
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with the speaker layout in use, so that a typical Ambisonic 
decoder implemented as in Figure 10 will have a means of 
causing the matrix coefficients to be altered in response 
to the measured or assumed speaker layout shape and angles 
5 shown in Figures 6 to 9. This may be done by a 
microprocessor software adjustment of coefficients, or by 
manual gain adjustment means. 

Appendix A below describes a general method for 
finding decoder solutions having the properties discussed 
10 above and Table l lists the values of the matrix 
coefficients for a given layout, and also describes the 
performance of the decoder in the different high and low 
frequency domains. Appendix B goes on to describe specific 
analytic solutions for particular layouts and Appendix C 
15 and Table 2 describe the low and high frequency solutions 
for nine different 5-speaker layouts. 

As noted in the introduction above, as well as 
providing an inherently improved front-stage image 
stability, the Ambisonic decoders of the present invention 
20 also provide a suitable basis for an enhanced decoder 
including additional channels providing improved stability 
of and separation between the front and rear stages. At 
the very simplest, in such an enhanced system one can add 
one additional channel signal denoted by E which 
25 incorporates a feed for a front loudspeaker. Such an 
isolated centre front signal has been found to be important 
in film and HDTV applications, in that typically dialogue 
and other sounds from the centre of the screen are more 
important than any other directions, and experiments in 
30 using Ambisonics plus a front-centre speaker feed have 
confirmed that such a method also works well in cinema 
applications. However, having only a single sound position 
that is highly stable proves rather inflexible and unsubtle 
for many applications. Nevertheless, such an added E 
35 channel in combination with B-format signals can yield 
useful benefits. A second added channel F can be used 
largely to cancel front-to-rear stage cross talk (which is 



WO 93/18630 



PCI7GB93/00429 



10 



31 

largely due to the Y signal) and to widen the frontal 
stage. In combination with E and the three B-fonaat 
signals, the F signal gives a frontal stage reproduction 
closely approximating 3-channel frontal stereo. Any sounds 
assigned to such a high-stability frontal stage should also 
be encoded conventionally into the three B-format signals 
so that users discarding the E and F signals will still get 
B-format reproduction incorporating those sounds. 

in view of these considerations, the present example 
provides a decoder for an enhanced B-format comprising up 
to 5 signals W,X,x, E and F for studio production 
applications in horizontal surround-sound with enhanced 
frontal image stability. This encodes signals from azimuth 
6 into the five channels with respective gains 
15 W with gain 1 

X with gain 2*cose 
Y with gain 2*sin0 

E with gain k e (l - 3.25(1 - cos*)) for |0| <6 S 

and gain 0 for \6\ > 6 S 
F with gain 2*k, sinfl for \6\ «S 0 S , 
gain -2*k b sinfl for I 18O o -0l < V 
and gain O for other 0 
where fl s is a frontal stage half width, typically between 
60- and 70", B e is the rear half stage width, typically 
around 70» and the gains k« and k f may be chosen between 
zero (for pure B-format) and a value in the neighbourhood 
of or equal to one (for reproduction effect purely in the 
front and rear stages). The co-efficient 3.25 may be 
subjected to slight changes in value somewhere between 3 
30 and 3.5. Enhanced B-format thus allows, by variations of 
the gains k e , k f , and k, (which should preferably be roughly 
equal) , the assignation of frontal stage sounds anywhere 
between pure B-format and positioning in the front and rear 
stages. As a production format, it allows reproduction in 
35 a large variety of different modes. 

For Ambisonic reproduction via 5 or 6 loudspeakers, 
Figure U shows a typical architecture for decoding 
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enhanced B-format signals incorporating an Ambisonic 
decoding algorithm as described earlier for pure B-format 
signals. Across the frontal stage for k e = k, = 1, it will 
be seen that F = Y and it will further be seen that for 
5 front centre sounds, W = E and X =2*E. Thus the signals 

W-E 

X-2*E 

and 

(54) 

Y - F v 
10 equal zero and cancel for due front sounds, and Y - F 
continues to cancel across the rest of the frontal stage, 
whereas, as 0 increases towards 60- and the gain of E falls 
to zero and then becomes negative, the other two of the 
signals of equation (54) become large, but in a manner that 
15 causes very little output from rear speakers. 

Thus the first step in an Enhanced B-format Ambisonic 
decoder is to derive the "cancelled" signals of equation 
(54) to feed a conventional 5- or 6- (or greater) speaker 
B-format Ambisonic decoder, and to take the E signal and to 
20 feed it with an appropriately chosen gain via a phase- 
compensating all-pass network (to match the filter networks 
in the Ambisonic decoder) to feed centre front loudspeakers 
directly. 

For azimuth zero sounds with k e = 1, this gives ideal 
25 localisation of centre front sounds. For sounds at other 
azimuths, the change of the sign of E's gain towards the 
edges of the frontal stage mean that the directly fed E 
signal now tends to cancel the centre-front speaker feeds 
deriving from the output of the B-format Ambisonic decoder, 

30 leaving largely just the front left and right speaker 
feeds. If one then provides the frontal speakers with a 
multiple of the F signal (passed through another phase- 
compensating all-pass network) as a left/right difference 
signal, the width of this largely frontal stage 

35 reproduction can be given a desired degree of left/right 
separation. 



WO 93/18630 



PCT/GB93/00429 



33 



10 



15 



By this ineans, the architecture of Figure 11, with 
initial "cancellation" of the enhancement channels E and F 
from the B-format signals before these are Ambisonically 
decoded, and the provision of direct speaker feed signals, 
via phase compensation networks, from E and F, can provide 
substantially conventional 3 -speaker stereo from frontal- 
stage sounds with k e = k f = 1, with relatively low crosstalk 
onto rear speakers, provided that the Ambisonic decoder 
design is a type having additional frontal stage speakers 
of the kind described above such as in Figures 6 to 10. 
The cancellation by E of a central speaker feed for encoded 
azimuths near ± 60° can be adjusted for a given decoder 
design by a careful choice of the direct speaker feed gains 
of the E signal. In particular, while Figure 11 shows the 
E and F signals as being simply fed forward and mixed into 
the C F , S c and S F signal paths in a manner that is (apart 
from phase compensation) independent of frequency, in a 
practical design, a judicious feed of a small amount of E 
signal to the M F and M„ signal paths, and of the F signal 
to the S B signal path in small amounts, possibly with a 
frequency dependence in the gain, can yield a small but 
useful improvement in the overall performance of front- 
stage stereo sounds. 

It will this be seen that the diagram of Figure 11 
illustrates the structure- of an enhanced B-format decoder 
only in its most basic form, and that slightly more complex 
direct feeds of the E and F signals, with the dominant 
components feeding respectively C F and S c and S F may be used 
to optimise front-stage performance, possibly using gains 
30 that vary somewhat with frequency. 

In typical 5-speaker decoders, it is found that the 
gain of the E signal fed to C F is typically around g E = 2, 
and the gain f, of the F signal fed to S F is typically 
around 1 to ensure broadly "discrete" frontal 3-speaker 
stereo. These figures vary somewhat with the Ambisonic 
decoder design and speaker layout. 
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The function of the E signal is to increase the 
"separateness" of the frontal speaker feeds, especially 
that of centre-front, whereas the F signal has the effect 
of cancelling out the left/right difference signal f ron the 
5 rear speakers and increasing it at the front, thereby 
converting signals from true Ambisonic surround-sound 
signals to ones dominant ly reproduced from a frontal stage. 

The gains k e and k f that give predominantly discrete 
speaker feeds at the front are around 1, and if one wishes 
10 to keep rear speaker levels low for frontal stage sounds, 
it is desirable to put It, - 1. However, in general, an 
improved localisation quality of phantom front-stage images 
is typically achieved not with k e = l, but with k e having 
a value near 0.4 or 0.5, as is shown by computed values of 
15 0 V and 6 e for decoders of the form of Figure 11. 

The design of the best direct gains for the E and F 
signals for each B-format Ambisonic decoding design, for 
each speaker layout, is a matter of subjective tradeoffs of 
different aspects of frontal-stage localisation quality by 
the designer, and does not form a strict part of the system 
standards for enhanced B-format, but rather a decoding 
option that may be varied within quite wide limits. It is, 
of course, necessary to ensure that reasonable results can 
be obtained, and the basic architecture of Figure 11 based 
25 on the 5- or 6-speaker Ambisonic B-format decoders 
described in this specification, or its minor modifications 
suggested above, does broadly achieve the desired results 
of enhanced frontal-stage image stability very similar to 
the use of separate frontal-stage stereo transmission 
channels, while still incorporating full B-format surround 
sound signals in an economical manner. 

While the above has explained how decoders using 
additional channels E and F similar to those shown in 
Figure 11 provide a greater "discreteness" and separation 
of front-stage azimuthal sound with I 6\ Z 6 S , the same 
method also reduces rear-to-front stage crosstalk across 
the rear stage azimuths I 180»-fil < 6 t when k b has a value 
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near one. This is because the subtraction of F from Y in 
such a rear stage has the effect of doubling the gain of Y, 
thereby increasing the left/right difference signal across 
both front and rear stages by a factor 2, and the addition 
5 of substantially F to the front stage difference signal 
cancels out the contribution F and Y to the front stage. 

It will be appreciated that instead of subtracting F 
from Y at the input stage of the decoder of Figure 11, it 
may instead by preferred to add or subtract the F signal, 

10 after passage through a phase-compensating network to match 
the phase of the bandsplitting filters, with various 
coefficients directly to the signals S c , S F , and S B so as to 
achieve a similar effect. Similarly, the subtraction of E 
at the input stages of Figure 11 may be replaced, in whole 

15 or in part, by appropriate additions or subtractions of 
multiples of E, after phase-compensation filtering, to C F , 
Mp and Mg . 
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The invention may also be applied to signals encoded 
by methods other than B-forraat, and in particular to a 
directionally encoded signal conveyed via two channel 
encoding, such as the two-channel surround-sound 

5 systems known as UHJ, BMX or regular matrix. One 

example of such an ambisonic decoder according to the 
invention for a rectangular loudspeaker layout 
illustrated in figure 12 is now described, although the 
methods here may be applied to more complicated 

10 loudspeaker layouts also. 

One method of encoding sounds assigned an azimuthal 
direction 9 into two audio signal channels is that used 
in the BMX system of encoding/ whereby a first signal M 

15 is encoded with gain 1 for all azimuths, and a second 
signal S is encoded with complex-valued gain 

eJ G = cos© + jsin© (XI) 
where j = y 7 -! represents a relative 90° phase shift or 
Hilbert Transform. The 2-channel example described 

20 here vill be described in terms of BMX encoding, 

although it will be realised that similar methods apply 
to other 2-channel encoding methods. 

The psychoacoustic localisation theory described 
earlier can be applied to loudspeaker signals with 

25 complex gains rather than real gains gj by putting 
rycosOy = Re[v x /P] 

r v sinO v = Re[v y /P] < x2 > 
using the same notations for P, V x and V y in equations 
(5), (6x) and (6y) earlier, where Re means "the real 

30 part of". r v and ©v * n this case have similar 

psychoacoustic interpretations as in the case that gj 
are all real gains. The equations (8) to (10) above 
may still be used to compute the localisation parameters 
r E and 0 E as before, and the equations (11) and (13) 

35 still define the desirable ambisonic decoding equations 
that.ve ideally wish a decoder to satisfy. 
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A new factor that mu9t be considered for decoders 
with complex speaker feed gains is the perceived 
"phasiness" of signals. Phasiness is an unpleasant 
subjective effect caused by phase differences between 

5 loudspeakers, which causes broadening of illusory sound 
images, an unpleasant change in perceived tonal quality, 
and an unplesant "pressure on the ears" sensation. For 
a forward-facing listener, the degree of phasiness 
effect may be quantified by the quantity 

10 q = Im[V y /P] (X3) 

where Im means "the real coefficient of the imaginary 
part of". 

While subjective sensitivity to phasiness varies 
among individual listeners, it is found that the effect 

15 is objectionable if the magnitude of q exceeds about 0.4, 
and is usually acceptable if the magnitude of q is less 
than about 0.2. It is further found that generally, 
phasiness is more objectionable for frontal stage sounds 
than for rear stage sounds, so that it is generally 

20 preferred that decoder designs be biased to producing a 
frontal stage phasiness of less than 0.2 magnitude, even 
if this should mean a quite large phasiness in the rear 
stage . 

In the previous art described by one of the 
25 inventors' British patent number 1550627, a means was 

described of reducing phasiness for 2-channel ambi sonic 
decoders using rectangular or other loudspeaker layouts 
across a frontal stage, at the expense of increasing it 
across a rear stage. The present invention, applied to 
30 2-channel ambisonic decoders, allows a lower phasiness 
and an increased value of r E to be achieved across the 
surround sound stage than was possible vith this 
previous art . 

As noted earlier, in the previous ambisonic decoder 
35 art, the reproduced pressure signal P was substantially 
a single signal subjected only to a . shelf filtering 
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process to meet the requirements of lover and higher 
frequency localisation, whereas the present decoder uses 
a pressure signal P whose polar diagram varies 
substantially with frequency, thereby achieving at 
higher frequencies, a value of r v that is not constant 
with encoded azimuth, but which instead roughly tracks 
the variation of r E with encoded azimuth. 

Denoting the respective rear left, front left, 
front right and rear right loudspeaker feed gain by 
the symbols L B , t,p, R F and R B as in figure 12 1 and 
denoting the respective loudspeaker azimuths with 
respect to a central listener by 180°-rf, 4, -rf and 
-180°+rf, where 6 is the half-width of the stage 
subtended by the front speaker pair, one can design 
decoders for this speaker layout as follows. First 
write 

*d = ^b^F^F+Kb) 
X<3 = ^(-L B +L F +R F +R B ) 

Y d = ^(Lb^f-Rf-Rb) 

Fd = ^(-L B +L F -R p +R B ). (x4) 
Then it can be shown that 9 V = if on e puts F d = 0, 
and that in this case 

r v cos© v = Re(X d /W d )cosrf 

r v sin© v = Re(Y d /W d )sinrf 

g = Im(Y d /W d )sinrf ( X 5) 
and that further 



Moreover, in this case that F d * 0, it can be shown that 



30 



R F 

Rb 



^(W d -X d +Y d ) 
^(Wd + X d +Y d ) 
^(W d +X d -Y d ) 
*g(W d -X d -Y d ) . 



(X6) 



P = 2W d , V x = 2X d cosrf, V y = ?Y d sinrf 



(X7) 



and that 



35 



E ■ l«dl 2 ♦ |X d |2 + |Y d |2 
E x = 2(cosrf) Re(X d W d *) 
E y = 2(sinrf) Re(Y a W d *) , 



(X8) 
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where * Indicates complex conjugation. From these 
results, the "psychoacoustic localisation parameters" 
r V r E' ^V =Q E and ^ can be computed via 
r v cos0 v = %( E x / |W d | 2) 

rysinOy = %(E y /|w d |2) ( X 9) 
so that 

r v = (E x 2+E y 2)V(2|w d |2) (xiO) 
r E = (2|w d |2 )rv / E = 2r v /( 1 ♦ |^| + |^| ). (Xll) 



Figure 13 shows an example of an ambisonic decoder 
for a rectangular speaker layout, which provides signals 

X d , and Y d derived via phase-amplitude matrices 
from input signals M and S, in two separate signals 

15 paths at low and at high audio frequencies (typically 
with a cross-over frequency around 400 Hz) produced 
from the input signals via phase-compensated band 
splitting filters as described earlier in connection 
with figure 10. Such low and high frequency signals 

20 w d , X d and Y d may then be fed to an output amplitude 
matrix, such as in equ. (X6) above, to derive output 
loudspeaker feed signals suitable for a layout such as 
shown in figure 12 . 

By using two phase-amplitude matrices for the 

25 two audio frequency ranges, it is thereby possible 

to optimise r v to equal 1 in the low frequency range and 
to use a different matrix so as to maximise r£ and to 
minimise the effects of phasiness q in the higher 
frequency range. It will be appreciated that the band 

30 splitting filters need not preceed the phase amplitude 
matrices, but may alternatively follow them or be 
placed in the middle of the signal path of the matrices. 
In particular, in practical implementations, it is often 
convenient for the relative 90° phase difference 

35 networks, which are relatively complex and which form a 
part of any phase-amplitude matrix, to precede the 
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bandsplitting filters. o 

Also shown In figure 12 is an optional "forward 
dominance" adjustment, which in general will differ 
from that for B format given earlier, but which performs 

5 a similar function of altering the gains and azimuthal 
distributions of different encoded azimuth directions 
while maintaining the characteristics of the particular 
"locus of encoded directions characterising the 
directional encoding scheme for which the decoder is 

10 designed. 

In the prior art, as described in British patents 
1494751, 1494752 and 1550627, there is a known solution 
to the ambisonic decoding equations for which 9 = 0 V 
= © E for BMX, which may be characterised in terms of 

15 the above signals w d , x d and Y d as follows. Put 

x = cosO and y = sinO, so that eJ G = x * jy. Then 
the prior art solution is 
W 6 = k 0 l = k 0 H 

X d = k 1 (x+jy)/cos«J = ktS/cosrf 
20 Y d = [kj(y-jx) + k 3 jl]/sin*f 

= [-kijS + k 3 jM]/sinrf , (X12) 
where the realgain constants kn< *i and k 3 are frequency 
dependent, with ko = k t at low frequencies to ensure 
that r v = 1# and with k x equal to between 0.5 and 0.7 
25 times kn at high frequencies, with k 3 equal to about 
to help maximise r£ and minimise phasiness q. 
It will be noted in particular that in this prior 
art solution, in each frequency range r v is independent 
of encoded azimuth 0, being equal to k^/kQ, and that 
30 the pressure signal P = 2W d is simply the signal M 
subjected to a frequency-dependent gain kn* without 
any variation of polar diagram to encoded azimuth 0 
as frequency varies - 

We have found new solutions of the ambisonic 
35 decoding equations for BMX for which 9 = 0 V = ©e- 
These have the form 
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W d = t\H ♦ t 2 S = tx 1 * t 2 (x*jy) 

X d = k^-^M ♦ tjSj/cosrf = k 1 (-t 2 l*t 1 (x*jy))/cos<i 

Yd = (-k 2 jS+k 3 jM)/sinrf = [k 2 (y-jx)+k 3 j]/sin* , (X13) 

where t\, t 2 , k] , k 2 and k 3 are 5 real parameters 

5 chosen such that 

r v cos© v = k 1 (t 1 2-t 2 2 )x/(t l 2 + t 2 2^2t 1 t 2 x) = r v x (X14) 

and 

r v sinO v = (k 3 t 2 +k 2 t 1 )y/(t 1 2 *t 2 2 +2t 1 t 2 x) = r v y, (X15) 

so as to give 0 = ©v = ®E* 

10 This is ensured by requiring that 

*3t 2 + K 2M = MM 2 - t 2 2 ) . <X16) 
With this equation, the new BMX solution to the 
ambisonic decoding equation © = © v = °E has four free 
real parameters , one of which merely represents the 

15 overall reproduced gain. If t 2 i 0, these solutions 
differ from the prior art, and if the ratio of t 2 to 
t\ varies with frequency, the resulting decoder has 
a pressure signal P whose polar diagram varies with 
frequency and such that r v varies with azimuth O. 

20 Solutions with r v = 1 are given whenever 

t 2 = 0, kj = 1 and k 2 = tj , (*17) 
for any choice of K 3 , and such solutions are apt to 
the low frequency audio region as explained earlier. 
At high frequencies, it is found to be better that 

25 t 2 i 0. Typically, the pressure signal gain P = 2W d 

is chosen such that at low frequencies it tends to have 
an omnidirectional polar pattern, but at high 
frequencies it is more sensitive to the back than to 
the front- 

30 The phasiness of this decoder is given by 

q = [(k 3 t 1 -k2t 2 )+(k 3 t 2 -k 2 t 1 )x]/(t 1 2 +t 2 2+2t 1 t 2 x). (X18) 
We have found that the following values give a BMX 
decoder vith excellent high frequency performance: 
t, = 1, t 2 = -0.15, k t = 0.5, k 2 = 0.53915, k» * 0.3360 (X19) 
35 which satisfy the equation (X16) above, and also ensure 
that q r o for 0 = +45° azimuth, thereby helping to 
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minimise phasiness across a frontal azimuthal stage. 

The values of the localisation parameters and 
total reproduced energy gain for this high frequency 
BMX decoder for various encoded azimuths 9 are given 



10 



in the 


following 


table for 


a square 


speaker 


9=9 V =9 E ry 


q 


r E 


gain dB 


00 


0.8765 


-0.2390 


0.6686 


1.66 


45© 


0.6031 


0.0000 


0.5675 


2.36 


90° 


0.4780 


0.4077 


0.4176 


3.69 


135«> 


0.3659 


0.6753 


0.3303 


4.71 


1800 


0.3666 


0.7610 


0.3040 


5.07 



with similar results for negative azimuths because 
of left/right symmetry. 

1 5 Compared io prior art BMX decoder* via square loudspeaker layout!, this 
decoder gives a lower rear-stage phasiness for given values of r E . 

Note from the above table that 
the values of r E and ry more-or-less track as azimuth 
varies, which we have found to be generally desirable 

20 for good performance in arabisonic decoder designs. 

It will be noted that the reproduced gain in the 
above table varies with azimuth. As in the earlier 
B-format examples of the invention, a suitable 
forward dominance transformation may be used prior to 

25 decoding, as shown in figure 13, to compensate for such 
gain variation. A forward dominance transformation that 
preserves BMX encoding is given by the amplitude matrix 
transformation 

M* = %(l+w)M «• ^(l-w)S 

30 S' = ^(l-v)M + ^(l+w)S , 

where w is a positive parameter. This has the effect 
of leaving 0=0° sounds vith unchanged gains, but of 
multiplying the amplitude gain of rear G = 180° sounds 
by a factor w, and also of altering encoded azimuths © 

35 to a new azimuth G 1 given by the formula equ . (4a) 
where is now given by: 
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p = (l-i?)/(l*v) . (X21) 
As in the B-format case, the forward dominance matrix 
may be combined with the phase-amplitude matrices. 

The above BMX decoder is not only applicable to use 
5 with rectangular loudspeakei layouts, since the output 
amplitude matrix in figure 13 may be replaced with 
alternative output amplitude matrices described in the 
prior art in British patents 1494751, 1494752, 
1550627 and 2073556 for regular polygon, regular 
10 polyhedron or other loudspeaker layouts comprising 
diametrically opposed pairs of loudspeakers. 

While BMX encoding has been used in the above 
description for ease of description, the same decoding 
method can be used with other 2-channel directional 
15 coding methods, and in particular with conventional 

amplitude-panned stereophony. In this case, left and 
right speaker feed signals L and R may be converted 
into BMX signals M and S suitable for the above decoder 
by means of a phase-amplitude matrix 
20 M = ( L+R ) + jw(L-R) 

S = (L+R ) - jw(L-R) (X22) 
or by a phase-amplitude matrix 
M = (L+R) - jw(L-R) 

S = (L+R) + jw(L-R) , (X23) 
25 where v is a positive stage width parameter which may 

be predetermined or adjustable by the user. Again, this 
matrix may be combined with the phase-amplitude matrices 
shown in f igure 13 . 

The above-described decoder for conventional 
30 stereophony may also be used with signals encoded for 
the Dolby 2-channel cinema encoding method with 
advantageous results, and for signals encoded for 
regular matrix encoding. 

While., for simplicity of description, the 2-channel 
35 example of the invention has been discussed only in 
connection with regular loudspeaker layouts, more 
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complicated loudspeaker layouts such as those shown in 
figures 6 to 9 may also be used, in which the signals 
P, V x and V y have the forms already given in 
connection with equation (X13) and (X16) for BMX, i.e. 
5 such that equ. (X16) hold and such that 

P = 2[ti+t 2 (x+jy) ] 

V x = 2k 1 (-t 2 +t 1 (x+jy)) 

v y - 2[k 2 (y-jx) ♦ k 3 j] (x24) 
and in which other linear combinations of loudspeaker 
10 feed signal gains are adjusted so as to ensure that 
6 E = ©v By this means, decoders for 2-channel 
surround-sound encoded signals may be devised which 
feed 5- or 6-speaker layouts such as shown in figs. 
6 to 9 and which satisfy the ambisonic decoding 

15 equations. 

Figure 14 shows the block diagram of a typical 
2-channel decoder satisfying the ambisonic decoding 
equations and capable of feeding five or six 
loudspeakers arranged as in figures 6 to 9. It win 

20 be seen that such a decoder is broadly similar to the 
8-format decoder of figure 10. except that it is fed 
by a 2-channel encoded signal, which is then fed to 
a first phase amplitude matrix which provides four 
output signal components W 2 , X 2 , Y 2 and B 2 which 

25 typically represent "pressure" , "forward component of 
velocity", "leftward component of velocity" and 
"pressure phase shifted by 90©- signals, and these four 
signals are then bandsplit via phase compensated low and 
high pass filters and fed into respective low- and 

30 high-frequency amplitude matrices. The outputs of these 
matrices are then handled in an identical manner to that 
already described in connection with the later stages 

of figure 10. 

The four signal components are, in typical 
35 implementations, related by B 2 being a nonzero imaginary 
multiple of W 2 and by Y 2 being a nonzero imaginary 
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multiple of X 2 , and by being such that W 2 and X 2 are 
"left/right symmetric" encoded signals in the sense that 
their gains as a function of encoded azimuth 9 satisfy 

W 2 (-9) = [W 2 (G)J* 
5 X 2 (-0) = [X 2 (Q))*, < x25 > 

typically having the form 

W 2 (0) = ai+a 2 cos9 + a3jsinO 

X 2 (o) = b 1 +b 2 cos© + b 3 jsinG , (*26) 
where a x , a 2 , a 3 , b x , b 2 and b 3 are real coefficients. 
10 For example, in the BMX case, one may have 

W 2 = M, X 2 = S, Y 2 = -jS, B 2 = jM . <X27) 
For such signals W 2 , X 2 , Y 2 and B 2 , the low- 
and high-frequency amplitude matrices in fig. 14 will 
then have the form 
15 c F = a c W 2 + b c X 2 

Mp ■ a F W 2 + b F X 2 
M B = a B W 2 + b B X 2 
S C = k c Y 2 ♦ 1 C B 2 
Sp = k F Y 2 + 1 F B 2 
20 S B = k B Y 2 + 1 B B 2 , ( x28 > 

where a c , b c , a F , bp, a B , b B , k c , l c , kp,.l F , k B and 
1 B are real coefficients, by analogy with equations (27) 
in the B-format case, and where the output amplitude 
matrix in figure X$ is given by equations (26) as in the 
25 B-format case. 

It will thus be seen that in the two channel case, 
• the broad architecture of a decoder satisfying the 
ambisonic decoding equations is similar to the three 
channel B-format case, except that an input phase- 
30 amplitude matrix produces four signal components to be 
processed rather than three. Because much of the 
signal processing is similar, large parts of the signal 
processing circuitry or algorithm may be common to use 
for decoding from different 2-channel and 3-channel 
35 sources. 
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We now indicate how new decoder solutions according to 
the invention may be derived for 2-channel directional 
encoding systems other than EMX, including the UHJ 
system. We consider systems of encoding sounds into a 
5 360 degree range of direction angles G, where typically 
and for convenience of description, 9 is measured 
anticlockwise in the horizontal plane from the forward 
direction. 

10 Such systems encode directional sound into two 
independent linear combinations (for example the sum L = 
X+A and the difference R = X-A) of signals I and A with 
respective gains 

I = a + bx + jcy 

15 A = jd + jex + fy (X-29) 

where a,b,c,d,e,f are real coefficients and where x - 
cosG and y - sinG. For example, in the UHJ 2-channel 
encoding system described in M.A. Gerzon, "Ambisonics in 
Multichannel Broadcasting and Video", J. Audio Eng. Soc, 

20 vol. 33 no. 11, pp. 859-871 (1985 Nov.), one has 
a = 0.9397, b = 0.2624, c = 0 

d - -0.1432, e - 0.7211, f = 0.9269. (X-30) 

Since such directional encoding systems use only 2 
25 channels, all signals used in decoders are complex linear 
combinations of just two signal components, which we 
shall denote as W 2 and X 2 analogous in the general case 
to signals with gains 1 and x+jy in the special case 
already described of BMX. The analogous signals W 2 and 

30 X 2 are conveniently chosen to be those signals used for 
the pressure and forward- facing velocity components of a 
2-channel decoding system disclosed in one of the 
inventors British patent 1550628 for a system with 2- 
channel encoding equations (X-29). The signal X 2 may be 

35 multiplied by a real constant times j to obtain a signal 
Y 2 and the signal W 2 may be multiplied by a real constant 
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times j to obtain a signal B2 suitable for use in a 
decoder for 2-channel encoded signals according to the 
invention having the form shown in figure 14. 

5 By way of example of a decoder according to the invention 
for a more general encoding system of the form (X-29) / 
consider the case where the encoding system signals are 
linear combinations of signals W2 and X2 with respective 
gains 

10 W 2 = 1, X 2 = x+jBy (X-31) 

which generalises the BMX case by having a real factor B 
not necessarily equal to ±1, although in most cases its 
magnitude will be fairly near 1 in value, for example in 
the range 0.7 to 1.4. 

15 

In this case, we may implement a decoder according to the 
invention for which 

P = til + t2 (x+jBy) 
v x = k 1 {-t3l+t 4 (x+jBy) } 
20 v y - k 1 {k 2 (y-jB- 1 x)+k 3 jl} (X-32) 

which are all complex linear combinations of W 2 and X 2 . 
In the case of decoders of the form of fig. 13 for the 
same rectangular loudspeaker layout of fig. 12 described 
. in the BMX case, this may be done by putting 
25 W d « til + t 2 (x+jBy) 

x d = ki(-t3l+t4 (x+jBy) }/cos<|> 
Y d = k 1 {k 2 (y-jB- 1 x)+k 3 jl) /sin<J> 

F d = 0, (X-33) 
which is a generalised version of equations (X13) of the 
30 BMX case, except that here we have chosen to normalise k_2 
and k 3 differently by taking out an additional factor ki; 
this is purely a matter of analytic convenience in the 
following. 

35 Prior art known decoders for such encoded signals that 
gave correct decoded azimuth 0, i.e. that had 
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Re[v x /P]:Re[v y /P] = x:y, < x " 34) 
all satisfied t 2 = 0, and hence had a pressure polar 
diagram that did not vary with frequency, and all 
satisfied k 2 = t 4 , and thus had, at each frequency, 
values of r v which were independent of encoded azimuth. 
However, as in the BMX case above, the present invention 
allows decoders to be designed substantially satisfying 
Eq. (X-34) for which the pressure polar diagram varies 
with frequency and for which r v varies significantly (by 
more than say 5% in value) with encoded signal direction. 



Computations using the above formulas then show that 

|X 2 ,2 = h ( i+b2) + 4(1-b2) ( x 2-y2) , (X-35) 
using the fact that X 2 +y 2 = i for all directions 9, and 
15 that 

Re[v x /P] = (Re[v x P*])/|P| 2 

Retv y /P] = (Re[v y P*])/IPI 2 . (X " 36) 

where _ 

,p,2 = ti 2 + t 2 2is(l+B2) + t^d-B 2 ) ( X 2-y2) (X-37) 
20 Re[v x P*]/k! = -t 3tl + t 2 t 4 H(l+B2) + t 2 t 4 H(l-B2) (x 2_ y 2) 

+ (t!t 4 -t 2 t3)x (X-38) 
RetVyP*]/^ = (k 2 ti+k3t 2 B)y. < x " 39) 
Unlike in the BMX case, making the constant term in Eq. 
(X-38) zero is not enough to make Re[v x /P) :Re[v y /P] 
25 proportional to x/y. However, the term 

t 2 t 4 *(l-B2) (x 2- y 2) 0 f E q. (X-38) is zero for azimuths 9 - 
±45» and +135", and typically causes only small reproduced 
azimuth errors for other azimuths (being worst around 9 = 
90°) since the coefficient t 2 t 4 k(l-B2) is generally small 
30 compared to the coefficient t]t 4 -t 2 t 3 of x. 

For this reason, we may ignore the small term 
t 2 t 4 *(l-B2) (x 2_ y 2) in E q. (X-38).. One then has 

reproduction of sound substantially in the correct 
35 velocity vector azimuth 9 if and only if 
Re[v x /P] :Re[vy/P] - x:y, 
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which is then the case from Eqs. (X-38) and (X-39) if 

-t 3 tx +t 2 t 4 k(l+B 2 ) = 0 (X-40) 

and 

(tit 4 -t 2 t 3 ) = (k 2 t 1 +k 3 t 2 B) . (X-41) 

5 

Eqs. (X-40) and (X-41) are in turn satisfied if we 
determine the normalisation factor )q by putting 

t 4 = t x (X-42) 
so that from Eq. (X-40), 
10 t 3 = H(l+B 2 )t 2 (X-43a) 

and substituting into Eq. (X-41) we find 

tx 2 - *s(l+B 2 )t 2 2 » k 2 ti+k 3 t 2 B. (X-43b) 

Thus Eqs. (X-42) and (X-43a) and (X43b) provide, for t 2 
15 not equal 0, a more general solution to decoding W 2 and 
X 2 than known in the prior art but sharing substantially 
the same decoded azimuths. At low frequencies, where it 
is desirable that ry = 1* the older known solutions with 
t 2 = 0 may be used, and at higher frequencies above a 
20 psychoacoustically determined cross-over frequency in the 
region typically of 400 Hz, the decoder, may use a nonzero 
value of t 2 giving a value of ry which varies with 
direction, preferably being chosen so as to be larger 
across the frontal stage of encoded directions than 
25 across the rear stage, as in the BMX example given above. 

We may rewrite equations (X-33) as 

W d = tiW 2 + t 2 X 2 

X d = kil-taV^+t^J/cos* 
30 Y d = k a {k 2 Y 2 +k 3 B 2 }/sin<t) 

F d = 0, (X-44) 
where W 2 , X 2 , Y 2 = -jB~ 1 X 2 , B 2 = jW 2 are of the general 
form described above for general decoders of figure 14 
for 2-channel directional encoding. 

35 
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Although the UHJ encoding . system does not strictly 
satisfy equations (X-31), it may be decoded in a similar 
way using 

W 2 = 0.982S + 0.l64jA 
5 X 2 = 0.4191 - 0.828jA 

where a value B = -1.085 approximately gives a 
satisfactory directional decoding. 

It will be seen that for any 2-channel decoder satisfying 
10 Eqs. (X-32) or Eqs. (X-28) that the pressure signal is of 
the general form 

P - a w l + bv?x + jcwy (X-45) 
and that the velocity signals v x and v y are of the 
general form 

15 v x = a x l + b x x + jc x y (X-46) 

Vy = -ja Y l - jbyx + cyy (X-47) 
where the coefficients aw, b^, cy, a x , bx# c x , ay, by and 
cy are all real. It is desirable for 2-channel 
directional encoding systems having the form indicated in 

20 Eqs. (X-29) that reproduced pressure and velocity gains 
be of the general form of Eqs. (X-45) to (X-47) if the 
results are to have a desirable left/right symmetry. 

In general , decoders for 2-channel encoded signals of the 
25 form of Eqs. (X-29) having larger ry and r^ across a 
frontal stage than across a rear stage will, as in the 
BMX example given earlier, have an undesirable gain 
variation with direction, with the less well localised 
rear stage sounds being reproduced with louder energy 
30 than the frontal stage. As in both the B- format and the 
BMX cases considered earlier, it is possible to subject 
the encoded 2-channel signals to a linear transformation 
which has very little effect on the encoding 
specification of directions except that the directions 
35 themselves are altered slightly and changed in gain. In 
the case of encoded signals W 2 , X 2 of the form of Eqs. 



WO 93/18630 



51 



PCT/GB93/00429 



(X-31), such a trans format ion produces transformed 
signals W 2 *, X 2 1 of the form 
W 2 ' = H(l+w)W 2 + H(l-w)X 2 

X 2 ' = H(l-w)W 2 + h(l+w)X 2 (x-48) 
5 similar to Eq. (X-20) for BMX, where w is a positive 
parameter, and where w < 1 if is desired to increase 
gains of sounds at the front and reduce them at the back. 

In a decoder, transformed signals w 2 f , X 2 * may replace W 2 
10 and X 2 , for example in Eqs. (X-44), whenever it is 
desired to adjust the reproduced directional gains. 
While this directional transformation may be implemented 
as a complex 2x2 matrix on the directional ly encoded 
signals before decoding, it is generally preferred if 
15 this matrix is either combined with the phase amplitude 
matrix in the decoder that derives the signals fed to the 
low and high frequency amplitude decoding matrices, or is 
implemented as a real linear matrix on signals such as 
W 2 , X 2 , Y 2 and B 2 . Such preferred implementations avoid 
20 having to use additional phase amplitude matrixing, which 
is generally more costly and harder to do well than 
simple amplitude matrixing, due to the use of and need 
for relative 90 degree phase difference networks. 
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The invention may be applied using the methods and 
principles described in more complicated cases than those 
decoders explicitly described in the examples. For 
example, the invention may be used with loudspeaker layouts 
5 having seven, eight, nine or more loudspeakers disposed in 
a left/right symmetrical arrangement around a listening 
area. The structure of such decoders is identical to that 
described with reference to Figures 10, 11 and 14, except 
that additional pairs of signals M, and S, are provided for 
10 feeding any left/right symmetric additional pairs L, and R, 
of speakers. 

Such layouts with further loudspeakers again 
preferably have a greater number of loudspeakers across the 
frontal reproduced stage than across the rear reproduced 
15 stage so that the reproduced value of r E is larger for 
frontal stage sounds than for rear stage sounds, and again 
it is preferred that the values of r v and r E as a function 
of encoded sound direction should roughly track each other. 
With such increased numbers of loudspeakers, the B- 
20 format enhancement signals E and F may as before be added 
to and subtracted from respective M f and S, signals so as 
to increase the separation among front stage loudspeakers 
and between front and rear stages. 

Such decoders are designed by exactly the same methods 
25 described in Appendix A and D in the 5 and 6 speaker case. 
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APPENDIX A 



SOLVING AMBISONIC PECO n TNG EQUATIONS 

We illustrate the method of finding the general 
5 left/right symmetric B-format decoder solution to equation 
(25) with reference to a decoder for the 5-speaker layout 
of Figure 7, assuming speaker feed signals of the form 
given by equations (26) with (27). A direct computation 
using equations (5), (6) , (8) and (9), yields from 
10 equations (26) 

P = C F + 2M F + 2Mb (28a) 
V x = Cp + 2MpCOS0p - 2MBCOSf% (28b) 



where, by a slight abuse of notation, we use the same 
20 symbols to represent the gains of signals for a given 
encoding azimuth 6 as we do to indicate the signals 
themselves. 

The quantities P, V x and V y of equation (28) are all 
left/right symmetric real linear combinations of W, X and 
25 Y. In particular, from equation (7), the requirement that 
6 V = 0 as in equation (25) implies that 

V x : V y = X r Y , (30a) 
30 so that we may put 



15 



V y = 2S F sinrfp + 2S B sin^B 



Vy - Z-opsmiup ' t^y-*"^jt> 

E = C F 2 + 2(M F 2 + S F 2 + Mb 2 + S B 2 ) 

= Cp 2 + 2(M F 2 +S F 2 )cosrf F - 2(Mb 2 +S b 2 )cos*% 
Ey = ^MpSpsintfp + 4M B S B sinrfg , 



(28c) 
(29a) 
(29b) 
(29c) 



V x = 2^5gX 
v y = 2*5gY 



(30b) 
(30c) 



35 where g is an overall gain factor. In order to simplify 
the equations following, we shall set 
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so as to avoid repeating a lot of factors g in the 
analysis; however, it will be necessary to multiply the 
overall decoder coefficients thus obtain in equations (27) 
by an overall gain g afterwards, in order to obtain a 
5 desired overall gain of reproduction. In particular, it is 
desirable to match the gains of low and high frequency 
Ambisonic decoding matrices so as to ensure a flat overall 
frequency response. 

Substituting equations (28) to (30) into equation 

10 (12) , we get 

Y[C F 2 + 2(M F 2 +S F 2 )cos|* F - 2(M B 2 +S B 2 )cosf%) 
= 4X[MpSpsinrfp + h%S B sin^] . 

15 substituting S F - k F Y and S B = k B Y into this from equations 
(27) , and dividing both sides by Y (which means discarding 
the exceptional solution Y - 0 to equations (25) , which 
only applies for azimuths 0° or 180°) , we get 



20 



Cp 2 + 2(M F 2 + k F 2 Y 2 )cp - 2(M B 2 +k B 2 Y 2 )c B 



= 4X[KpM F Sp + KbMbSbI ' (31) 

where to reduce notational clutter, we have written 

cp = cos<$p, sp = sintfp 
25 eg = cosj% and s B = sine^. (32) 



But from equation (2) , Y 2 - 2W 2 - X 2 , and from equations 
(28b) , (30b) and (30d) , 

Cp = 2^X - 2MpCp + 2h%C3 , (32b) 

so that substituting into equation (31) gives 

(X-2^HCp + 2^c B ) 2 + M F 2 C p - M B 2 CB ♦ (Kp 2 ^^) (2W 2 -X 2 ) 
= 2X(*rV F s ? * k B M B s B ) , 
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which, for an arbitrary real constant a, may be rewritten 
in the form 

[M F 2(2c F +l)c F ♦ Mb^Cb-Dcb - 4(^1^] 
. 2X[(k F s F +2 J >c F )M F + (^-2^)^] +«* 2 

= 2(k B 2cB.k F 2cF) W2 + ( * - l*k F 2c r -^ 2c B> X ' ^ 

For a suitable choice of a, to be determined, the left hand 

side of equation (33) can be f actor ised, and the right hand 
10 side of equation (33) can also be factorised provided only 

that the coefficients of W 2 and X 2 are of opposite signs. 

By setting factors on the two sides equal. to each other, we 

find solutions to the decoding equations (25) which are of 

the form given by equations (27). 
15 The first step in factorising the left hand side of 

equation (33) is to f actor ise the first term in square 

brackets, i.e. to write it in the form 

(* F M F + « b Mb) (^pMjp + ft B H B ) , ( 34 ) 

where for convenience ve choose to put 
20 oc F =/3 F =^[(2 Cr H)cp] , (35) 

which is real provided that |rf F |< 90° . Putting 

a = ep(2cp+l) , b = 20^03 and c = 03(203-!) (36) 

we find by solving a quadratic equation that the 
25 factorisation equation (34) equals the first square bracket 
term on the left hand side of equation (33) provided that 

* F */l F =ya < 37a) 
a B = [-b-V<b2-ac)]/Va (37b) 
te = [-b + v / (b2-ac))/^a , < 37c) 

which are real, so that a factorisation exists, only if b 2 
> ac. 

The left hand side of equation (33) can thus be 
written in the factorisable form 



30 



35 
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if we have 

-A ^ (39a) 

«X* + *X*F ■ -2Ccp-P*«*> (39b) 

"X&B + <Vb = - 2(k B s B" 2 ' c B ) 

5 

and we put 

(39c) 

'xftc • 

Given k F and k„. one can solve the linear equations (39a) 
10 and (39b) in a x and p %l giving: 

[-2dc F s F ^CF)>B - [~^BSg^fB)H; (40a) 

[-2(k F s F ^C F )]p B - l-H^W 2 ^** ^ 

15 Px = ^H-^Pf 

from which a can be computed via equation (39c). 

Thus, we can factorise both sides of equation (33) and 
write: 

20 («E«F + «#<B ♦«XX"PF*«F * PB^B + Px*> 

= (-C)IYX ♦ Wl (-C)-l(sgnD[rX ¥ SW] 

where 

25 

where 

r = * X &X - 1 + k F 2 CF " k B 2c B 
and 

30 * =^|20c f 2cf-1Cb 2 Cb>I < 42b > 

provided only that the coefficients of W 2 and X 2 on the 
right hand side of equation (33) do not have the same sign, 
where C is an arbitrary nonzero coefficient that can be 
35 chosen freely. 
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If we select a value of k F , then the value of k fl can 
be computed from equations (28c), (30c) and (27) to be 
given by 

(43) 

lc B = (2-5 - k F s F )/s B • 

Thus, specifying a chosen value of k F and C, and a choice 
of the ± signs in equation (41) allows us to put. 

Pf H F * AM ♦ w - (-o-WHrx ? *»]. < 44b) 

10 and 

apMf + +« X X = (-C)trX * i«l (44a) 

and equations (44) thus form a pair of simultaneous linear 
equations in M f and whose solution expresses M F and M, 
15 in the form of equations (27) . Having solved equation (44) 
and M f and M.. equation (32b) can then be used to express 
C. in the form of equation (27). Thus, given an arbitrary 
choice of the coefficients k, and C, a choice of the sign 
+ in equation (44) , this completely solves the problem of 
20 finding a B-format solution of the form equation (27) to 
equation (25) , provided only that the coefficients of W 
and X 2 in equation (33) (which depend only on the choice of 
k F ) do not have the same sign. 

We have implemented a numerical program to determine 
solutions to the B-format Ambisonic decoding equations (25) 
to (27) for 5-speaker decoders using the above solution 
algorithm, with input user parameters k F , C and the sign ±. 
It has been found that the behaviour of the resulting 
solutions behaves in quite a singular way particularly as 
k F varies near the values for which the coefficients of W 
or X 2 in equation (33) become equal to zero. It turns out 
that subjectively desirable solutions tend to be quite 
close to these singularity values, so that a first step in 
finding solutions is to determine what values of k F cause 
either coefficient of W 2 or X 2 in equation (33) to equal 
zero, and to ensure that either k F exceeds the larger such 



25 
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value or is smaller than the smaller such value in order 
that the signs of the W 2 and X 2 coefficients differ. 

Exploring the values of r v and r E of different 
solutions, it has been found that the sign of ± in 
5 equations (44) should be chosen to be the upper sign, that 
k F should exceed the largest "critical value" for which one 
.of the coefficients of W 2 and X 2 in equation (33) equals 
zero, and that C should be positive, typically between 0.5 
and 2. 

10 A low frequency solution with r v = 1 in all directions 

may be found most easily by noting that P has the form 
P = a P W + bpX < 45 > 

computed via equation (28a), and that r v = 1 for all 
15 directions if an only if 

(45b) 

ap = 2 and bp = 0 . 

Thus a low frequency solution can be found by varying k F 
20 and C until equations (45b) are found to be satisfied; such 
values can be found by a numerical "hill climbinq" or 
Newton's algorithm method. We have found that generally, 
there are to such r v = 1 solutions within the chosen 
desirable range of parameters k F , C and t, and that the one 
25 with larger k F generally gives larger values of r E , and so 
is more desirable. 

As explained earlier, finding a high frequency 
solution maximising r E is a more subjective thing, since r E 
cannot simultaneously be maximised in all directions. 
30 However, it has been found that typically, excellent 
results are obtained by choosing values of and C in the 
desirable range of values such that a p approximately equals 

. 145c) 
a P = 8^ , 

35 which gives r v = 0.7071 for azimuths 0 « ± 90*. The choice 
of b p is less clear, but in general, b p at high frequencies 
is preferably chosen to be a negative coefficient such that 
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for 6 = 0°, the outputs from the Lg and speakers are 
close to or equal to zero, and at least 20 dB below the 
outputs from the frontal loudspeakers. 

In doing designs of Ambisonic decoders for any given 
5 layout shape, (i.e. given <p f and 0 B ) , the values of k F and 
C are varied and for each such choice of values, it is 
desirable to compute a p and b pf and also the coefficients 
in equations (27), and additionally to compute the speaker 
feed gains, the energy gain E (in decibels) , and the values 

10 of the psychoacoustic localisation parameters r v , 0 V , r E , 
and 0 E for each encoded azimuth 0 (selecting perhaps 
typical values say 6 = 0\ 15% 45*, 60% 90% 135- and 
180° .- there is no need to examine negative azimuths, since 
the results are left/right symmetrical). One should, of 

15 course, have 0 V « 0 E = *, so such computations provide a 
useful check that the above algorithms have been computed 
correctly. 

It is then possible to see how r E in particular varies 
with azimuth so as to select a good choice at high 

20 frequencies. However, satisfying equation (45c) and 
ensuring that L, = R, = 0 for 6 - 0° provides a reasonable 
"automated" choice of high-frequency decoder. As with the 
r v = 1 low frequency solution, however, there are generally 
two such solutions, and the one with larger k F is generally 

25 found to have better r v and r E performance. 

By way of example, in Table 1, we show the computed 
results of an analysis and decoder design for the case <p f 
= 4 5° and <p B = 50°, both for the low frequency r v = 1 
solution and the high frequency solution satisfying 

30 equation (45c) and having rear-speaker outputs equal to 
zero for 6 = 0. It will be noted that r E is larger at the 
front than at the back, and is usefully larger over a 
frontal stage than the typical value r E = 0.7071 
encountered for prior art Ambisonic B-format decoders. 

35 However, this example also illustrates a typical defect 
encountered with 5-speaker and 6-speaker decoders designed 
according to the methods- herein - namely that those 
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directions for which r E is largest (and for which high 
frequency localisation is best) are reproduced with the 
lowest gain and those for which r E is smallest (and for 
which localisation is poorest) are reproduced with the 
5 highest gain. This is clearly undesirable. 

in order to overcome this problem, it is necessary to 
use forward dominance to help reduce the gain of back 
sounds. Typically, the degree of forward dominance applied 
will be that which compensates for the difference in total 
10 energy gain between due front and due back sounds, at high 
frequencies, thereby giving equal gains in the front and 
back stages. The price paid for using forward dominance to 
compensate for gain variations in the decoder is that the 
reproduced azimuth 8 V = « E no longer equals the encoded 
15 azimuth 6, but a modified azimuth %• given by equations 
(4) . For forward dominance, this generally results in a 
narrower reproduced frontal stage. This is often 
desirable, since it helps to narrow the rather wide frontal 
Ambisonic stage to be a better match to the rather narrower 
20 frontal stage encountered with stereo reproduction systems 
using n F frontal stage channels and n B rear stage channels, 
and helps improve the match between the directions of 
sounds and associated visual images with HDTV. 

in general, such additional forward dominance need not 
25 be implemented as a separate pre-decoder adjustment as 
shown in Figure 10 (although such additional adjustment can 
be a useful listener control), but may preferably be 
implemented as altered coefficients a c , b c , a f , b F , a„ and b,, 
in the decoder matrices implementing equations (27) - there 
30 is no need to alter the Y coefficients since these are 
unaffected by forward dominance adjustments. The modified 
coefficients a c '. b c ' , a f ', b f ', a.' and V may be derived 
from the computed coefficients a c to b, as follows: First 
compute the values (46a) 
35 a c +2V . a c - 2^ , ( 

then multiply them respectively by X and X , giving ^ 
c . = Mac^bc • < c" = (a c -2 ! !bc)/\ 
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and finally compute the modified coefficients 
ac' = ^(c'+c M ) 
be' = (c'-c")/^ . 

5 Identical computations are used to compute the other 

coefficients, simply by replacing the subscripts C in 
equation (46) either by F throughout or by B throughout. 

In addition to using forward dominance with gain X 2 to 
compensate for the difference between front and rear gain 

10 at high frequencies, it is also desirable to adjust the 
overall gain of (say) the high frequency decoder to match 
that of the low frequency decoder. Since in general the 
way gain varies with encoded azimuth will not be identical 
at low and high frequencies, in practice it is necessary to 

15 choose a particular azimuth (say 0 V = 45°) at which to make 
the gains of the low and high frequency decoders identical. 
Such an application of dominance and gain adjustment 
finishes the design procedure, and it is only necessary to 
check that for the average of the low and high frequency 

20 coefficients in equations (27) , that the computed values of 
0 V and 0 E do not deviate markedly from their values at low 
and high frequencies, to ensure that the decoder of Figure 
10 continues to perform well in the cross-over frequency 
range. It is found that 0 V does not vary in the cross-over 

25 range thanks to equations (30b) and (30c), and that 0 E 
differs from 0 V in that frequency range only by an 
insignificant fraction of a degree. 

Very similar design methods are used for decoders for 
B-format decoders for six speakers, with a similar use of 

30 factorisation of two sides of an equation similar to 
equation (33). The main difference is that there is an 
additional free parameter k c (see equations (27)) in 
addition to k F and C, so that optimisation of decoder 
designs (including the r v = 1 low frequency case and the a p 

35 = Sh high frequency case) involves trying to maximise r E 
over a wider range of design parameters, and in doing such 
designs it helps to have interactive computing facilities 



(46c) 
(46d) 
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so that the way decoder performance alters as parameter 
values change can be examined interactively. 
Alternatively, by putting k c = 0, similar design methods to 
those used in the 5-speaker case can be used, with only 
relatively small changes in formulas from equations (28) 
onward. However, a 6-speaker decoder design with ^ - 0 
will not necessarily give the best possible performance. 
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APPENDIX B 

SPECIAL SOLUTIONS 

The complexity of the analytic solutions for general 
5 speaker layouts motivates a search for Ambisonic decoder 
solutions that are analytically simple, for 5 or 6 
speakers. Such simple solutions exist for the special 
case, illustrated in Figures 6 and 8, for which the Lg, L F , 
R f and Rg loudspeakers lie on a rectangle. These special 
10 cases are of considerable interest in their own right. 
Here we give the results we have found, derived by methods 
involving factorisation similar to those used in the last 
section. We omit full details of the derivations of these 
results. 

15 We introduce, for rectangle speaker layouts, the 

notations 

W 0 = *1(LB + Lp + Rp + R B ) 
Xq = *$(-L B + L F + R F - R B ) 
Y 0 = + W ~ R F - R B> 

20 f 0 = + W - *f + Rb> ■ (47) 



The matrix equation (47) is a 4 x 4 orthogonal matrixing, 
and its inverse is 

25 Lb = *s(w 0 - X 0 + Y 0 - F 0 ) 
Lp = hW 0 + Xq ♦ Y 0 + F 0 ) 
R F = ^(W 0 + Xq - Y 0 - F 0 ) 

RB ■ W«b - *0 - Y 0 + F 0 ) - (48) 

30 If we require that equations (30) hold for decoders for 
these layouts, it is found that the following solutions 
exist to the Ambisonic decoding equations: 



35 



7.1 4-Speaker Decoder 

This solution has 
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C F = Sc = 0 (giving no output from frontal speakers) 
F 0 = 0 

Xq = X/(2*cosrf) (49) 
Yn = YA^sinrf), ^ . 

where preferred decoders generally have k 2 = 0, and where 

M = 1 (49b) 



10 



15 



25 



30 



for the low frequency r v - 1 solution, and 

(49c) 

for the high frequency solution. This solution is the 
well-known Ambisonic decoder for a rectangular speaker 
layout described in the prior art Ambisonic literature. 



7.2 c r = w 9 solu tion 

For both 5- and 6-speaker layouts, this solution is 

characterised by the equations 
F 0 = S C = 0 

20 C F * w 0 j (50a) 

Y Q = Y/U^sinrf) 

and, for the 5-speaker case 

Xq = [2^X - C F ]/(2cosi*) (5 ° b) 
or for the 6-speaker case 

Xq * [2^X - 2cos J zS c C F ]/(2cos?S) . 
For the 5-speaker case, put 



(50c) 



and for the 6-speaker case, put 

w 0 = ^( kl w + K 2 X) < (5<M) 
where in both cases, the r v = 1 low frequency solution has 
35 tcj = 1, k 2 = 0 ' (50e> 
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10 



15 



This solution generally has very bad r E for rear azimuths, 
so is not generally recommended. The "best" values of k, 
and k 2 at high frequencies do not give such a good r £ even 
for azimuth 0° sounds as solutions described below, and 
considerably worse r E at the rear. 

7.3 Forward- and Backward oriented Solutions 
7.3.1 5-Bpeaker Case 

This satisfies the equations 

Xo = (2^X- C F )/(2cosirf) 

F 0 = kY/^sinrf) , (51) 



where k is a free parameter and C a nonzero free 
20 parameter, and the f orward-or ^nted solution is that with 
the upper choice of signs in equations (51) and the 
backward-oriented solution is that with the lower choice of 
signs. The forward-oriented solution is found to be 
subjectively far more satisfactory than the backward- 
25 oriented solution or the C F = W 0 solution, and superior to 
the 4-speaker solution across a broad frontal state of 
azimuths. 

The low frequency r v = 1 solution can be shown to be 

given by the formulas 

1 



30 k = 



sin 2 ** ** sin 2 rf 



C = (1±K)/V8 . (51b) 
The value of our earlier parameters k F and k B is given in 
35 this case by 

k F = «l*k)/(2*iinrf) (51c) 
^ , ! s (l-k)/(2 ? 5sir.«5^ - 
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7.3.2. Case 

This special case satisfies 

2%cos«S 

Xq = (2'"5X - 2cos«ScC F )/(2cosizS) 

Y 0 = Y/(2 ! 5siiM{) (52) 
F 0 = kY/(2 ! Ssin85) , 



15 where k and C are free parameters, with C * 0, and the 

. ,_^ on + 0 H solution is that with the upper choice of 

signs in equations (52) , and the tnrVwi.rrt-oHm.ted solution 
is that with the lower choice of signs. As in the 5- 
speaker case, the forward-oriented solution is found to be 
subjectively far more satisfactory than the backward 
oriented solution of the C F = W 0 solution, and superior to 
the 4-speaker solution across a broad frontal stage of 
azimuths . 

The low frequency r v = 1 solution can be shown to be 
25 given by the formulas 



20 



30 



35 



cos^csin 2 «5 cosjJc sin^j) 

C- = isd+K) (cosdic)^ . < 52b) 
The rectangle cases dealt with in this Appendix, with 
are rather special in that the 4-speaker and C f 
= W 0 solutions arise in the special case that the 
coefficients of both W 2 and X 2 in equation (33) (or its 6- 
speaker equivalent) equal zero. There is no counterpart to 
these special solutions in the non-rectangular case. 
However, the solutions considered in subsection 7.3 are 
special cases of the general solutions discussed in section 
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6, distinguished only by virtue of the relative simplicity 
of the form of the solution. 

The fact that there are a number of quite distinct 
families of solutions for Ambisonic decoders for specific 
5 speaker layouts seems to be quite a general phenomenon. 
For example, there are two distinct solutions to panpot 
laws for three speaker stereo such that 0 V = B t . For 
speaker layouts even more elaborate than the five or six 
speaker layouts considered here, the structure of the space 
10 of solutions to the Ambisonic decoding equations can be 
quite complex, and a computer search among the solutions is 
required to identify those having the best r E behaviour. 
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ftPPENDIX C 



wnMB pTPRL RESULTS 

Table 2 lists a range of low and high frequency 
5 designs for 9 different 5-speaker layouts computed by the 
me thods of appendices A and B, including forward dominance 
to compensate for front/rear gain variations and a gam 
adjustment of the high frequency decoder to ensure that it 
has the same gain at reproduced azimuths ± 45- as the low 
10 frequency decoder. The cases o t = 35% 45- and 55- and C B - 
9 = 5', 0- and -10- are listed. Because both the low and 
high frequency decoder matrices are chosen according to 
"objective" criteria, it is possible to use quadratic 
interpolation to derive 5-speaker Ambisonic decoders for 
15 other intermediate values of © F and o B - 9 f . 

It has been found, however, that low frequency r v = 1 
solutions do not exist for all angles 0 F , V For example, 
for the values o F = 35% * B = 55% there is no r v - 1 
solution. in general, such low frequency solutions are 
20 found to exist for 0 B < •,. but in general, * B cannot be 
more than about 10% or 15% larger than 9 t before an r v - 1 
solution can no longer be found. In cases where the r v - 
l solution does not exist, one should seek to use a low 
frequency solution having as large a value of r v as is 
25 possible if this still gives a greater r v than at high 
frequencies* 

The attainable value of r E at high frequencies for 
front stage sound is clearly enhanced by the use of the 5- 
speaker decoder, as seen in Table 2, as compared to similar 

30 4-speaker rectangle decoders, thanks to the significant 
output from the C f speakers, with r E typically being 
increased from 0.7071 for a square layout to around 0.835 
when a C F speaker is added. This almost halves the degree 
of image movement for front stage sounds. It will be seen 

35 that the value of r E at the sides and back is not 
drastically reduced, although the average value for r E over 
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the whole 360' stage is not increased, and in fact is 

"^"gh the value of r e at the front is not 
brought up to ideal values very close to 1, the use of a 5- 
spea'er Lisonic decoder provides an improved xmage 
stability, as compared to previous designs, without gxvxng 
n unacce table loss of the rest of the surround sound 3 
' "age. Thus a 5-speaker Ambisonic decoder desxgned as here 
bribed etches TV use a great deal better than earlxer 
decoders, and ma*es good use of just three transx.xss.on 
channels although there is still a need for enhancxng 
fronlstage results by adding extra transmission channel 

Si9na T he use of six speakers gives a further improvement of 
r E at the front, improving image stability further, whxle 
still giving reasonable values or r E (typically around 0.6) 
around the rest of the sound stage, as the 
aecoder solutions listed in Table 3 illustrate. The degre 
of frontal stage image movement of the 6-speaKer decoder s 
tyP ically only 40% of that encountered wxth -speaKer 
Coders. Where possible, the use of sxx speakers 1S 
preferable to five in terms of frontal xmage stabxlxty. 
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APPENDIX D 



SIX— SPEAKER B— FORMAT SOLUTIONS TO AMBISONIC DECODING 
EQUATIONS 

5 Here we outline the general solution to the 6 -speaker 

Ambisonic decoding eguations for B-format signals for the 
speaker layout of Figure 9, analogous to the 5-speaker 
solution given in Appendix A. Except where explicitly 
defined otherwise, all notations are those of the above 
10 text and will not be redefined here. 

Use the notations c c = coso c and s c = sino c and t c = 
tano c . 

For the 6-speaker decoder, we assume that we seek 
solutions of the form of equations (26) and (27) , where we 
15 assume that we start the design by assuming values of the 
parameters k c and k F . Then the quantities P, V x , V yf E, E x , 
E y are given by 
P = 2(C F + Hp + Mg) 
V x = 2(ccC F + CpMp - cbMb) 
20 V y = 2(scS c + S F S F + s B S B ) = 2(s c ke + s F k F + s B K B ) Y 

E = 2(C F 2 + S c 2 + Hp 2 + S F 2 + Mb 2 + S B 2 ) 

= 2(C F 2 + M F 2 + f-% 2 ) + 2(kc 2 + kp 2 + k B 2 ) Y 2 
Ex = 2(C F 2 + S c 2 )cc + 2(M F 2 + S F 2 )cp - 2(h% 2 + S b 2 )cb =r 
Ey = 4(s c C F Sc + SpWfSp + s B M B S B ) 
25 = 4(s c kcC F + SfkfWp + S B k^B) Y . (Al) 



30 



Putting by analogy with equations (30) 



V x = 2h X 



V y =2*Y < A2 > 

and ensuring that 0 V = 6 Z = 0 by setting 
ExV y = EyV x , (A3) 

we get from equations (Al) by dividing by Y (and hence 
35 ignoring the very special solutions with ¥ = 0) 
Cp 2 c c + M F 2 cp - M B 2 c c + 0c c 2 c c + k F 2 cp - *B 2c B> y2 
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(A6) 



7J 

From equation (A2) and (Al) , we have 

C F = (2-*5X - cpMp + cbMb)/cc 
and also from equation (2) that 
5 Y 2 = ?w2 - X2 . 

Substituting these into equation (A4) , we get 
(2~h X - CpM F + CbMb) 2 /c c + cpMp 2 - CbMb 2 

+ (Kc 2 cc + *F 2c F - Kb 2 ^)* 2 * 2 - x2) 
10 = 2[(2~ i 5X - CpW F + CgMaJkctc + spkpWp + S^hfe) X . (A7) 

Note that from equations (Al) and (A2) , particularly the 
equations for V y , that k B is given in terms of k c and k F by 

15 Kb -(2-% - s c Kc - s F K F )/s B . < A8 > 
Then equation (A7) can be rearranged to give 

+ <*X 2 

= AW?+ rx 2 (A9) 



25 where a is an arbitrary constant to be chosen so that the 
left hand side of equation (A9) f actor ises, and where 

A = 2(k B 2cB - kc 2 cc - K f 2cf) (AlOa) 
and 

3 0 r = ^ _ _L + + K^cp _ ^Cfi + 2^t c • (AlOb) 

C C 

The first square-bracketed term on the left hand side of 
equation (A9) can be factorised in the form 

(« F M F +*tf< B >(PF*F + fr**B ) 



where 
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« B * [_ b -v'CbZ-ac) Wa < Allb > 
3 B = [-b+>/(b2-ac)]/ya , < Allc > 

where a, b and c are defined as the three quadratic 
5 coefficients in the first square-bracketed term of the left 
hand side of equation (A9), i.e. 
a = cpU + cp/c c ) 

b = cfCb/cc (Alld) 
c = (-1 + c%/c c )cb ' 
10 Then the left hand side of equation (A9) can be factorised 

in the form 

(«B*% +«#B + *xX>(Pe«F + / 3 B M B + P& (A12) 
provided that a is chosen to equal 

15 « x p x (A13a) 

and that a x and 0 X are the solutions to the pair of linear 
simultaneous equations 

20 -a = | - + 2cpKctc (M3b) 

and 

*X*B + Px*B = Z^Cb/Cc) - 2s B k B - 2cBkctc . (Al3c) 

Having used equation (A13) to compute a x ,0 x and a, one can 
25 then compute the numerical values of T and A. 

In the very special case that k c and k F are such that 
r = A = 0, then we have that either 

*F*F + *bMb + *x* = 0 (A14a) 

30 or 

Pf"f * PbMb + Px* - 0 • (A14b) 
and either of the conditions (A14a) or (Al4b) is sufficient 
in that special case to ensure that the resulting decoder 
3 5 satisfies the 6 = 6 V = 6 E equations. Otherwise, it is 
necessary to choose values of k c and k F such that r and A 
do not have the same sign. 
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in that case, putting Y = V | r| and i = V / i|, 
equation (A9) can be put in the form 

♦ «#B + ^XXft«F * ^ *X*> (M5) 
= (yxi %W)(sgnD(YX 

so that, for an arbitrary nonzero constant C, we can 
separate factors and put, for arbitrary choice of the sign 

- (-O(VXiSW) CUSb) 
and 

- WW*"" W« 

Thus equations (A15b) and (A15c) are a pair of simultaneous 
15 linear equations for M F and M, in terms of W and X, once one 
has chosen the constants 

(Alo) 

kp, C and + . 

C f can be then derived from equation (A5) , and S e , S f and S B 
20 are given via equation (27), where k„ is given by equation 
(A8) . 

This completes the derivation of a solution of the 6- 
speaker decoding equations for 6 = 6 V = 6 E for a given 
speaker layout. In general, the best solutions, in terms 

25 of giving reasonable values of r„ are again those with ± 
= + and C positive, but a search among the possible values 
of the parameters k< and k f is required. There is a one- 
parameter family of solutions which have r v = 1, and one 
will generally choose those solutions giving largest r e at 

30 low frequencies. In general, when searching among the 
parameters (A16) for a good high frequency solution, 
similar methods to those described in section 6 are used, 
and it is convenient to search among the values of the 
extra parameter k c in the 6-speaker case by setting 

35 k c = Kk F 
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where the constant K will generally be chosen to be 
positive and typically having a value in the general 
neighbourhood of o c /o F . 

The need to search among the values of 3 continuous 
5 parameters (A16) for a good 6-speaker solution means that 
there is more choice in finding suitable equations for low 
and high frequency decoders for a given 6-speaker layout 
(provided that the layout is such as to give an r v = 1 
solution), and the designer of a 6-speaker ambisonic 

10 decoder thus has some leeway in designing different 
tradeoffs for different tastes. In making such design 
tradeoffs, it is advisable to write a computer program that 
not only computes the decoder equations for a different 
values of the free parameters (A16) , but which also prints 

15 out the values of the psychoacoustic localisation 
parameters r v and r E for azimuths around the circle, 
possibly in a graphical form, so that the effect of varying 
the decoder parameters (A16) can be seen in judging the 
final best tradeoffs. 

20 For any given speaker layout, once a decoder design is 

arrived at, the forward dominance should be adjusted to 
minimise front/back reproduced gain variations, especially 
at higher frequencies, and the relative gain of the low and 
high frequency decoders should be adjusted so as to give 

25 broadly similar reproduced gain at all frequencies for at 
least front-stage sounds, as already described in 
connection with 5-speaker B-format decoders. This will 
yield the final 6-speaker B-format ambisonic decoder 
equations which typically may be implemented as in Figure 

30 10. 

Such design procedures need to be done for a range of 
layout angles e> c , o F and o B in Figure 9 likely to be used, 
so that the Ambisonic decoder can be adapted to the layout 
actually used in any particular situation, and adjustment 
35 means for the decoder matrices in Figure 10 to allow this 
are desirably included. 
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In general, it is found that, for a given set of 
positions for the Lg, L F , R F , Rj loudspeakers, a 6-speaker 
design will give a significantly larger r E across the 
frontal stage than a 5-speaker design, with only a small 
5 reduction of r e , spread across the rear and side stages, in 
other directions. Thus, in general, 6-speaker designs are 
often better than 5-speaker designs in their subjective 
performance. The price paid for this improved localisation 
quality performance is the need to use larger amounts of 

10 forward dominance in 6-speaker designs (typically over 6 
dB) to compensate- reproduced gain variations than is needed 
for 5-speaker designs. 

It is thought that this trend of improved frontal r E 
and the need for yet more forward dominance applies even 

15 more in the case of 7- or 8- speaker Ambisonic designs with 
yet more front stage speakers. For such designs, the 
number of free continuous parameters increases (5 for the 
7 speaker case and € for the 8-speaker case) , so that 
surveying the possible solutions to choose a "best" 

20 tradeoff becomes very time consuming, and preferably 
requires computer graphic aids to present psychoacoustic 
performance data in an easily assimable form. 
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tow frequency d~oder design 5-speakers, * = <% - 5CP 
k F = 0.50527 f C * 1.13949 

Cp = 0.34190 W ♦ 0.23322 X, Hp = 0.26813 W ♦ 0.38191 X, S F = 0.50527 Y 

Mb * 0.56092 W - 0.49852 X, S B = 0.45666 Y 
decoder performance: 

©=©V=0r r V dB r E Is L F C F *F R B 

60 1.0000 3.6* 0.6,81 0.7677 1.1570 0.5068 -0 0806 -0.3509 

Hioh frequency ***** aesicm 5 speakers. «J F = 4*>. A * 500 
k F = 0.54094 , C = 0.93050 

4» 0.38324 W 0.37228X, * - 0.44022 W ♦ 0.23386X, Sp = 0.54094 Y 
Mb = 0.78238 W - 0.55322 X. 5* = 0.42374 Y 

decoder performance: ' 
frtfv^B r v dB r E ^ I* C F Rf "B 

0 0.8158 3 046 0 8273 0 0.7709 0 9097 0 7709 ^ 

S S:5S :S S:SS !:S ?:K o.7 555 0.1331 -o.»«. 

60 0.7576 4.585 0.7059 0.9102 1.2681 0 6465 -0 0569 -0.1278 
90 0.7071 5.620 0.6550 .3816 1.2052 u 0.9919 

S ::S£ i:S iSt J:S KK -0.1432 0.1095 

forward dc^nce^ before aj-y^gJ^J S - 0^ - 4 5 o by 
r£5S dB fain S-XiSTC hi* f regency decoder. 
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TABLES 2 AND 3 

Low frequencies = 35° , = 25° 

K F = 0.73675, C = 0.90593, forvard dominance = 3.8636 dB, gain = 0 dB 
C F = 0.44310W * 0.45887X, Mp = 0.42349W ♦ 0.24979X, S F = 0.73675 Y, 
Mg = 0. 37979 W - 0.32066X, S B = 0.67324 Y . 
High frequencies ffp = 35°, 6% s 25 ° 

K F = 0.74762, C = 0.80803, forvard dominance = 3.8636 dB, gain = -0.4217 dB 

C F = 0.49752 W 4 0.43912X, Mp = 0.54081 W 4 0.16195X, Sp = 0.71219 Y, 

M B = 0.52758 W - 0.37306X, S B = 0.62728 Y . 

psychoacoustic analysis 

lov frequencies high frequencies 

0 ©v=Ge .ry r E dB r v r E dB 

0 O.00 1.0000 0.9009 3.820 0.8952 0.9120 3.868 
15 12.03 1.0000 0.8319 4.130 0.8900 0.85 2 4.16? 
45 36.69 1.0000 0.5424 5.705 0.8504 0.5912 5.699 
60 49.61 1.0000 0.4399 6.379 0.8186 0.4988 6.388 
90 77.36 1.0000 0.3447 6.837 0.7412 0.4190 6.983 

135 125.29 1.0000 0.4405 4.820 0.6306 0.5192 5.699 

180 180.00 1.0000 0.8384 1.586 0.5843 0.7567 3.868 

Table 2a 5-speaker Ambisonic decoder design for rf F = 35°, u% = 25°. 
t>ov frequencies f* F = 35°, 6% = 35° 

k F = 0.63701, C = 1.00206, forvard dominance = 4.1113 dB, gain = 0 dB 
Cp = 0.42521 W + 0.34037X, Mp = 0.42380W ♦ 0.33924X, Sp = 0.63701 Y, 
Mg = 0.39173 W - 0.34051 X, S B = 0.59579 Y. 
High frequencies rfp = 35°, - 35° 

k F = 0.65584, C = 0.83708, forvard dominance = 4.1113 dB, gain = -0.5188 dB 
Cp = 0.46290W ♦ 0.37054X, Mp = 0.53317W ♦ 0.22733X, S F = 0.61782Y, 
Mg = 0.54101 W - 0.38255X, S B = 0.54351 Y. 

psychoacoustic analysis: 

low frequencies high frequencies 

0 ©v=©£ r v r E dB r V r E dB 

0 0.00 1.0000 0.8686 3.928 0.8853 0.8915 3.865 

15 11.86 1.0000 0.8251 4.118 0.8806 0.8527 4.057 

45 36.21 1.0000 0.6122 5.155 0.8443 0.6633 5.127 

60 49.00 1.0000 0.5227 5.626 0.8H7 0.5847 5.642 

90 76.56 1.0000 0.4317 5.899 0.7418 0.5.102 6. 03 

135 124.62 1.0000 0.5225 4.175 0.6345 0.5916 5.127 

180 180.00 1.0000 0.8087 1.860 0.5886 0.7561 3.865 

Table 2b 5-speaker Ambisonic decoder design for s5 F = 35°, <zb = 35°. 
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Lov frequencies rfp = 35°, <% = 40° . „ .« 

kF - 0.59675. C = 1.20337. forward dominance - 4.2769 dB. gain . = 0 dB 
4 = 0.44167W ♦ 0.22867X. M F = 0.40882 W + 0.41726X. S F = 0.59675Y, 
MB = 0.40080W - 0.35574X, S B = 0.56756 Y. 

High frequencies d F = 35°, = 40 ° . « e^t* 

J = 0.62175, C . 0.86543, forward dominance . 4.2769 dB, gam - -0.5,64 dB 
C F . 0.44995 H * 0.33464X, M F - 0.52657 W + 0.26606X, S F = 0.58384 Y, 
Mq = 0.55191 W - 0.39026X, S B = 0.51202 Y. 
psychoacoustic analysis: 

low frequencies^ _h«h ^equencies^ 



dB r v r E 



0 0.00 1.0000 0.847! 4.153 0.8803 0.8812 3.949 
15 11.75 1.0000 0.8150 '.283 0.8758 0.8511 4 098 
35.89 1.0000 0-6431 5.017 0.8412 0.6946 



60 46.58 1.0000 0.5619 5.358 0.8129 0.6244 5 384 

90 76.03 1.0000 0.4681 5.519 0.7424 0.5523 5 773 

135 124.17 1.0000 0.5185 4.060 0.6367 0.6136 «.wo 

80 180.00 1.0000 0.6908 2.339 0.5908 0.7368 3.949 



124.17 1.0000 0.5185 4.060 0.6367 0.6136 
180.00 1.0000 0.6908 2.339 0.5908 0.7368 

Table 2c 5-speaker Antoisonic decoder design for «S F = 35°. cfe = 40 °- 



Lov frequencies 6? - 45°, ^b = 35° 

k P = 0.58727, C = 0.91902. forward dominance = 3.1169 dB, gain = 0 dB 
C F = 0.41964W ♦ 0.45001 X, H F = 0.41347W * 0.27104X, S F = 0.58727 Y. 
Mb = 0.39285 W - 0.36850X, S B = 0.50882 Y. 
High frequencies = 45°, 6b - 35° 

* p = 0.60353. C = 0.85140. forward dominance = 3.1169 dB, gam - -0.7244 dB 
C F = 0.48102W + 0.43822X. Mp = 0.51317 H ♦ 0.18754X. S F = 0.55524 Y, 
Mb = 0.53400 W - 0.37759X, S B = 0.44965 Y. 
psychoacoustic analysis: 

low frequencies high frequencies 

6 0v-9e r v r E dB r v r E dB 

0 0.00 1.0000 0.8214 3.834 0.8284 0.8535 3.844 
15 12.56 1.0000 0.7946 3.953 0.8250 0.8306 -.957 
45 1.0000 0.6495 4.627 0.7991 0.7045 4.619 

60 51.52 1.0000 0.5809 4.946 0.7760 0.6439 4 960 
90 79.77 1.0000 0.5077 5.108 0.7260 0.5785 5.277 
135 127.27 1.0000 0.5872 3.817 0.6495 0.6300 4.619 
180 180.00 1.0000 0.7678 2.354 0.6168 0.7273 3.844 

Table 2 i 5-speaker Ambisonic decoder design for S$r = 45°. ffc = 
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Low frequencies rfp = = - 

k F = 0.52936, C = 1.00589, forward dominance = 3.5692 dB, gain = 0 dB 
Cp = 0.41221 W + 0.36631 X, M F = 0.4081OW ♦ 0.36266X, S F = 0.52936 Y, 
M B = 0.40697 W - 0.39951 X, S B = 0.47064 Y. 
High frequencies rfp = e% - = 45° 

k F = 0.55707, C = 0.89149, forward dominance = 3.5692 dB, gain = -0.7857 dB 
Cp = 0.46527 W + 0.41346X, Mp = 0.49415 W ♦ 0.24746X, S F = 0. 50889 Y, 
M B = 0.55584 W - 0.39304 X, S B = 0.40463 Y. 
psychoacoust ic analysis: 

low frequencies ^9* frequencies 

9 ©V=©E r v r E dB r v r E dB 

0 0.00 1.0000 0.7771 4.169 0.8194 0.8349 '.027 
15 12.24 1.0000 0.7645 4.213 0.8165 0.8226 4.083 



45 



37^28 l'oOOO 0^6869 4^68 0.7937 0.7472 4.426 

50.36 1.0000 0.6*32 4.579 0.7749 0.7046 4.612 

90 78.31 1.0000 0.5860 4.535 0.7273 0.6457 4.791 

135 126.08 1.0000 0.6113 3.637 0.6543 0.6435 4.426 

. . «~ ™ , ^aa a cam -> ft^Q 0.6219 0.6742 4.027 



180 180.00 1.0000 0.6810 2.839 0.6219 0.6742 
Table 2e 5-speaker Ambisonic decoder design for rfp = tfs ' ^ ~ A5 °- 

Low frequencies rfp = 45°, = 50 ° 

kp = 0.50527. C = 1.13949, forward dominance = 3.8929 dB, gain = 0 dB 
Cp = 0.42505 K ♦ 0.29374X, Mp = 0.39694W + 0.43438X, Sp = 0.50527Y, 
Mb = 0.41574 W - 0.42146X, S B = 0.45666 Y. 
High frequencies rfp = 45°, f% = 50° 

k F = 0.54094, C = 0.93050, forward dominance = 3.8929 dB, gain = -0.7838 dB 
C F = 0.46771 W ♦ 0.40469X, Mp = 0.48067 W ♦ 0.28334X, S F * 0.49427 Y, 
M B = 0.57135 W - 0.40401 X, S B = 0.38718 Y. 
psychoacoust ic analysis: 

low frequencies nigh frequencies 

9 9y=0£ r v r E dB r v r E dB 

n 0 00 1 0000 0.7494 4.497 0.8158 0.B273 4.208 



45 36^63 K0OO0 0.6996 4.513 0.7918 0.7655 

60 49.54 I. 0000 0.6705 4.489 0.7740 0.7313 4.536 

95 77.27 1.0000 0.6177 4.309 0.7285 0.67 4 4.640 

^ ns ?i l 0000 0 5824 3.710 0.6567 0.6325 4.429 

80 180.00 KISS SIS 3. 30 8 0.6 2 40 0.6294 4.208 

"able 2f ?-sp*aker Ambi sonic decoder design for d F = *5°, s*p = ^° 
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bow frequencies <J F = 55°, - 45° 

Kp = 0. '509 3 3, C - 0.97862, forward dominance - 2.2870 dB, gain * 0 dB 
C F = 0.39401 W ♦ 0.46686X, M F = 0.39742 W + 0.29732X, S F = 0.^0933 Y, 
M B = 0.41425 W - 0.43739X, S B = 0.40997 Y . 
High frequencies rf F = 55°, 6% - 45° 

kp = 0.53280, C = 0.89997, forward dominance = 2.2870 dB, gain = -1.0511 dB 
Cp = 0.46072 W ♦ 0.46569X, M F = 0.47680W + 0.21890X, Sp = 0.47207 Y, 
M B = 0.54710 W - 0.38686X, S B - 0.33915 Y. 
psychoacoustic analysis: 

low frequencies high frequencies 

q 9v=©£ r v r E dB r v r E dB 

0 0.00 1.0000 0.7185 4.036 0.7509 0.7882 3.961 
IS n.17 1.0000 0.7146 4.044 0.7497 0.7834 3.976 
45 39.91 1.0000 0.6889 4.087 0.7402 0.7507 4.071 
60 53.69 1.0000 0.6724 4.095 0.7324 0.7286 4.125 
90 82.48 1.0000 0.6463 4.071 0.7125 0.6881 4.178 
1^5 129.42 1.0000 0.6*12 3.675 0.6819 0.6564 4.071 
180 180.00 1.0000 0.6516 3.435 0.6682 0.6515 3.961 

Table 2q 5-speatcer Ambisonic decoder design for rfp = 55°, «fe = 45° 
Low frequencies ~ = & = 55° 

Kp = 0.47329, C = 1.01391, forward dominance = 3.0674 dB, gain = 0 dB 
Cp = 0.39903W + 0.41484X, M F = 0.38886W + 0.40426X, S F = 0.47379Y, 
M B = 0.42726 W - 0.48618X, S B = 0.38992 Y. 
High frequencies ^ F = = ^ ~ 

= 0.51350, C = 0.94696, forward dominance = 3.0674 dB, gain - -1.0292 dB 
C F * 0.46402 W + 0.48241 X, M F = 0.45136 W + 0.28083 X, Sp = 0.45613 Y# 
Mg = 0.58098 W - 0-41082 X, S B = 0.31063 Y. 
psychoacoustic analysis: 

low frequencies high frequencies 

G 9v=© E r v r E dB r v r E dB 

0 0.00 1.0000 0.6613 4.702 0.7455 0.7770 4.399 
IS 12.59 1.0000 0.6658 4.649 0.7445 0.7775 4.374 
45 38.29 1.0000 0.6954 4.781 0.7369 0.7763 4.208 
60 51.64 1.0000 0.711? 4.035 0.7304 0.7684 4.109 
90 79.94 1.0000 0.7080 3.675 0.7135 0.7227 4.007 
n5 127.40 1.0000 0.5943 3.841 0.6857 0.6040 4.208 
180 180.00 1.0000 0.5225 4.129 0.6725 0.5439 4.399 

Table 2h 5-speaker Ambisonic decoder design for 0 F = <* B = 6 - 55° 



WO 93/18630 



PCT/GB93/00429 



l£BQ££U££ltt *V ■ 55°, fa • 60° 

*p • 0.45806, C ■ 1 .13695, forvsrd doalnancs • 3.6033 dB, gslo • 0 dB 
Cp ■ 0.41789 W ♦ 0.36491 X, Kf • 0.37845* ♦ 0.48674X, Sf ■ 0.45806 Y, 

■ 0,43420V - 0.S2147X, % « 0.38321 Y. 
High freauenHej rfp • 55°, *b • 60° 

*p » 0.50892, C » 0.99169, forward doodnancs » 3*6033 dB, gain « -0.9719 dB 

Cp ■ 0.47960 W ♦ 0.49722 X, Mp » 0.43446 W ♦ 0.32011 X, Sp • 0.45504 Y, 

Mfc » 0.60440 W - 0.42738 X, « 0.29965 1. 

psychoa ecus tic analysis! 

lov frequencies high frequencies 

Q °V=^E ry rg dB ry rj dB 

0 0.00 1.0000 0.6259 5.227 0.7441 0.7742 4.732 
15 12.21 1.0000 0.6337 5.142 0.7433 0.7766 4.638 
45 37.21 1.0000 0.6890 4.531 0.7363 0.7872 4.392 

60 50.27 1.0000 0.7231 4.116 0.7303 0.7843 4.212 

90 78.20 1.0000 O.73O0 3.550 0.7144 0.7295 4.024 

135 125.99 1.0000 0.5327 4.093 0.6870 0.5581 4.392 

180 180.00 1.0000 0.4251 4.694 0.6736 0.4745 4.732 

T*bl« ?l 5-speaker Arabi sonic decoder design f or fa • 55°, fa « 600 

Low frequencies * 450, fa * 15° 
*C - 0.08476, C « 0.53306, *c « 0 

Cp « 0. 22881 (HOC), Sfc « 0, Mp - 0.22932{W*X), Sp • 0.54238 Y 
>% « 0.54187 V - 0.458125X, % ■ 0.45762 Y. 
Hlqfh frequencies 4 « 450, fa - 15° 
te • 0.2, C » 1.04567, *c « 0 

Cp • 0.275220WC), Sc « 0, Mp - 0.48161 W ♦ 0.01765X, Sp - 0.60000 Y# 

1% - 0.85756 W - 0.60639X, Sb « 0.40000 Y. 

psychoacoustic analysis 

low frequencies high frequencies 

©=Oy=©E r v rp dB ry rg dB 

0 1.0000 0.8084 0.954 0.8540 0.8708 1.449 
15 1.0000 0.7730 1.219 0.8431 0.8400 1.763 

45 1.0000 0.6171 2.697 0.7687 0.7047 3.562 

60 1.0000 0.5629 3.458 0.7180 0.6551 4.560 

90 1.0000 0.5297 4.489 0.6194 0.6068 6.197 

135 1.0000 0.6087 4.787 0.5187 0.6003 7.602 

180 1. 0000 0.6877 4.574 0.4860 0.6070 8.012 

Table 3a example of 6-speaker Ambi sonic decoder design with 4 - 45°, 
fa = 15° and * 0, without forward dominance or gain adjustments. 
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Lov frequencies i = 45°, 4q - 15° 

k = 0.08476, C = 0.53306, forward dominance = 6.5621 dB, gain = 0 dB 
Cp = 0. 37049 W ♦ 0.30791 X, S c - 0, Mp = 0.37131 W * 0. 30859 X, 
S F = 0.54238 Y, Mb = 0.33O40W - 0.34300 X, S B = 0.45762 Y. 
High frequencies 6 - 45°, 4q = l5 ° 

k = 0.20000. C =1.04567, forward dominance = 6.5621 dB, gain - -0.8300dB 
C F = 0.40502W* 0.33661 X, S c = 0, Mp = 0.4781OW + 0.13692X, 
Sp = 0.54532 Y, Mb - 0.53421 W - 0.37775 X, S B = 0.36355 Y. 

Table 3b 6-speaker ambisonic decoder design of table 3a with forward 
dominance and high-frequency gain adjustments. 
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CLAIMS 

1. A decoder (2) for decoding directionally encoded audio 
signals for reproduction via a loudspeaker layout (4) over 
5 a listening area, comprising: 

an input (21) for receiving the directionally encoded 

audio signals; 

matrix means (22,23) for modifying said audio signals; 

and 

10 an output (24) for outputting the modified audio 

signal in a form suitable for reproduction via the 
loudspeakers; 

the coefficients of said matrix means being such that 
at a predetermined listening position in the listening area 

15 the reproduced velocity vector direction and the reproduced 
energy vector directions are substantially equal to each 
other and substantially independent of frequency in a broad 
audio frequency range, 

characterised in that the gain coefficients of said 

20 matrix means (22,23) are such that the reproduced velocity 
vector magnitude r v of a decoded audio signal varies 
systematically with encoded sound direction at frequencies 
in the region of and above a., predetermined middle audio 
frequency . 

25 2. A decoder according to claim 1, in which the matrix 
means comprise: 

first matrix means (22) operative at low audio 
frequencies below a cross-over frequency; 

second matrix means (23) operative at high audio 
30 frequencies above the cross-over frequency, the second 
matrix means being different in effect to the first matrix 
means ; and 

cross-over means (25) for effecting the transition 
around said cross-over frequency between the first matrix 
35 means and the second matrix means; 

the broad frequency range in which the reproduced 

velocity vector direction and the reproduced energy vector 
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direction are substantially equal to each other and 
substantially independent of frequency encompassing said 
cross-over frequency and preferably covering several 
octaves ; and 

the reproduced velocity vector magnitude r ¥ varying 
systematically with encoded sound direction at frequencies 
in the region of and above the cross-over frequency. 
3. A decoder according to claim 1 or 2, wherein above the 
predetermined middle audio frequency r v is significantly 
larger for a frontal encoded direction than for a 
diametrically opposed rear encoded direction. 
4 A decoder according to claim 2 or 3, wherein the 
cross-over frequency lies between 150 Hz and 1 kHz and 
preferably between 200 Hz and 800 Hz. 
15 5. A decoder according to claim 2, 3 or 4, wherein for 
all encoded sound directions the reproduced velocity vector 
magnitude r v is significantly larger below said cross-over 
frequency than above said cross-over frequency. 
6. A decoder according to claim 2, 3, 4 or 5, wherein at 
frequencies below a cross-over transition region around 
said cross-over frequency, the reproduced velocity vector 
magnitude r v is substantially independent of encoded sound 
direction. 

7 A decoder according to claim 6, wherein at frequencies 
below said cross-over transition region, the reproduced 
velocity vector magnitude r v substantially equals 1 for all 
encoded sound directions. 

8 A decoder according to any preceding claim, wherein at 
frequencies above said predetermined middle audio 
frequency, the reproduced energy vector magnitude r E varies 
as a function of encoded sound direction in a broadly 
similar manner to the reproduced velocity vector magnitude 



r v . 
9. 



J a decoder according to any preceding claim, wherein 
control means are provided for adjusting the gain 
coefficients of said matrix means to adapt the decoder for 
a plurality of loudspeaker layout arrangements, said 
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control means modifying the gain coefficient so as to 
change components, including pressure components, of the 
reproduced audio signal. 

10. A decoder according to any one of claims 1 to 8, 
5 wherein said matrix means are arranged to decode the signal 
for reproduction over a loudspeaker layout having a greater 
number of reproduction loudspeaker across a frontal stage 
of directions than across a diametrically opposed rear 
stage of directions, the gain coefficients of the matrix 

10 means being such that at substantially all frequencies the 
reproduced energy vector magnitude r E of sounds encoded to 
be reproduced from vector directions within said frontal 
stage is significantly greater than the reproduced energy 
vector magnitude r £ of sounds encoded to be reproduced from 

15 diametrically opposed vector directions within said rear 

stage . ,. 

11 A decoder according to claim 10. wherein said 
loudspeaker layout is substantially left/right symmetrical 
about a forward axis or plane through the predetermined 

20 listening position. 

12 A decoder according to claim 11, wherein said 
loudspeaker layout comprises three loudspeakers disposed 
across said frontal stage and two loudspeakers disposed 
across a rear stage. 

25 13. A decoder according to claim 11, wherein said 
loudspeaker layout comprises four loudspeakers disposed 
across said frontal stage and two loudspeakers disposed 
across a rear stage. 

14 A decoder according to any preceding claim, in which 
30 the directionally encoded audio signals incorporate sound 

signal components representative of sound pressure and 
orthogonal directional sound velocity components. 

15 A decoder according to any preceding claim, wherein 
said matrix means are arranged to decode directionally 

35 encoded audio signals comprising at least three linearly 
independent combinations of an omnidirectional signal W 
with uniform gain for all directions, and at least two 
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directional signals X and Y, pointing in orthogonal 
directions, representing sounds encoded with figure-of- 
eight or cosine directional gain characteristics. 
16. A decoder according to claim 15, wherein the 
5 reproduced pressure signal at the predetermined listening 
position is at all frequencies a linear combination a^W+byX 
of W and X whose relative proportions a^by vary with 
frequency, the reproduced forward-pointing velocity signal 
at the predetermined listening position is at all 

10 frequencies a linear combination a x W+b x X of W and X whose 
relative proportions a^bj do not vary with frequency, and 
the reproduced sideways -pointing velocity signal at the 
predetermined listening position is at all frequencies 
proportional to Y. 

15 17. A decoder according to any one of claims 1 to 13, 
wherein said matrix means are arranged to decode 
directionally encoded audio signals comprising two 
independent complex linear combinations of an 
omnidirectional signal W with uniform gain for all 

20 directions, and at least two directional signals X and Y, 
pointing in orthogonal directions, representing sounds 
encoded with figure-of-eight or cosine directional gain 
characteristics . 

18. A decoder according to claim 17, wherein the gain 
25 coefficient of the matrix means are such that the 
reproduced pressure signal at the predetermined listening 
position is at all frequencies a linear combination 
ayW+byX+jCyY of W, X and Y, whose relative proportions ayifay 
vary with frequency, and the reproduced signal representing 
30 forward-pointing velocity at the predetermined listening 
position is at all frequencies a linear combination 
axW+l^X+jCxY of W, X and Y, and the reproduced signal 
representing sideways-pointing velocity at the 
predetermined listening position is at all frequencies a 
35 linear combination -ja y W-jb Y X+c^Y of W, X and Y, where the 
coefficients a u , b u , c u , a x , b^ c x , a Y , b y and c Y are real 
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and where j = ✓<-« represents a broadband relative 90- 
phase difference. 

„ A decoder according to claim 17 or 18, wherein the 
5 marrix Beans further comprise phase-amplitude matrix means 
arranged to produce at least three complex linear 

. . ■ v v and preferably or optionally a 

combinations W 2 , X 2 , Y 2 , ana preier i 

fourth linear combination B 2 , of two directionally encoded 
input signals such that W 2 and X 2 have directional gain of 

10 the form a 2+ b 2 X + c 2j Y for real gains a 2 , b 2 and c 2 that may be 
different for W 2 and X 2 and where V 2 and B 2 are respectively 
proportional to jX 2 and to jW 2 or to real linear 
combinations thereof, and wherein said signals are fed by 
cross-over means with matched phase responses to at least 

15 two amplitude matrix means corresponding to 

frequency ranges in the audio band to provide modified 
audio signals at the output of the decoder. 

20 A decoder according to any one of claims 15 to 19, 
. wherein the matrix means further comprise additional linear 
20 matrix means arranged to apply an additional linear 

transformation so that the output reproduced vector 
directions are related to the input encoded vector 
directions according to a transformation of direction. 

21 A decoder according to claim 20, wherein said 
25 transformation of directions is a Lorentz transformation. 

22 A decoder according to claim 20 or 21, wherein the 
effect of said additional matrix transformation is to 
render the total reproduced energy gains of sounds encoded 
at the front and at the rear substantially equal. . 

30 23. A decoder according to claim 20, 21 or 22, wherein 
said additional linear matrix transformation is 
as a linear matrix acting on said directionally encoded 
signals or linear combinations thereof. 

L A decoder according to claim 20, 21 or 22, wherein 
said additional linear matrix transformation is combined 
with said matrix means or said first and second matrix 
means . 
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25. A decoder according to claim 15 or 16, having at least 
three loudspeakers across a reproduced frontal stage, 
wherein said directionally encoded audio signals 
additionally comprise signals proportional to E and/or F, 

5 where 

E has a directional gain characteristic substantially 
equal to zero outside an encoded frontal stage of encoded 
directions and a gain proportional to a linear combination 
of W and X having a positive gain for sounds at the centre 

10 of the encoded frontal stage across a frontal stage of 
encoded directions; and 

F has a gain substantially proportional to that of Y 
across a frontal stage of encoded directions and a gain 
substantially proportional to that of -Y across a rear 

15 stage of encoded directions, 

and where the decoder incorporates means for adding E to 
and subtracting E from the signal components containing W 
and X so as to localize encoded frontal stage sounds more 
precisely in individual frontal stage loudspeakers and/or 

20 means for adding F to and subtracting F from signal 
components containing Y so as to reduce cross talk between 
reproduced front and rear sound stages. 

26. A decoder according to claim 24, wherein E has a gain 
of opposite polarity for sounds at the edges of the encoded 

25 frontal stage than for sounds encoded towards the centre of 
the encoded frontal stage. 

27. An audio system comprising: 
a decoder; 

a multiplicity of loudspeakers laid out around a 
30 listening area; and 

an amplifier for amplifying the output of the decoder 
to drive the loudspeakers; 

the decoder decoding directionally encoded audio 
signals for reproduction via the loudspeaker layout over 
35 the listening area, the decoder comprising: 

an input for receiving the directionally encoded audio 
signals; 
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matrix means for modifying said audio signals; and 
an output for outputting the modified audio signal in 
a form suitable for reproduction via the loudspeakers; 

the coefficients of said matrix means being such that 
5 at a predetermined listening position in the listening area 
the reproduced velocity vector direction and the reproduced 
energy vector directions are substantially equal to each 
other and substantially independent of frequency in a broad 
audio frequency range, 
10 characterised in that the gain coefficients of said 

matrix means are such that the reproduced velocity vector 
magnitude r v of a decoded audio signal varies substantially 
with encoded sound direction at frequencies in the region 
of and above a predetermined middle audio frequency. 
15 28- A system according to claim 27, in which the 
loudspeaker layout includes a greater number of 
reproduction loudspeakers across a frontal stage of 
directions and a lesser number of loudspeakers across a 
diametrically opposed rear stage of directions, and in 
20 which the gain coefficient of the matrix means of the 
decoder are such that at substantially all frequencies the 
reproduced energy vector magnitude r E of sounds encoded to 
be reproduced from vector directions within said frontal 
stage is significantly greater than the reproduced energy 
25 vector magnitude r E of sounds encoded to be reproduced from 
diametrically opposed vector directions within said rear 

29 a9e A system according to claim 28, in which the 
loudspeaker layout is substantially left/right symmetrical 
30 about a forward axis or plane through the predetermined 
listening position. 

30. A system according to claim 29, wherein said 
loudspeaker layout comprises three loudspeakers disposed 
across said frontal stage and two loudspeakers disposed 

35 across a rear stage. 

31. A system according to claim 29, wherein said 
loudspeaker layout comprises four loudspeakers disposed 
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across said frontal stage and two loudspeakers disposed 
across a rear stage. 

32. An audio-visual system incorporating in its audio 
stages a decoder according to any one of claims 1 to 26, or 

5 an audio system according to any one of claims 27 to 31. 

33. A method of decoding directionally encoded audio 
signals for reproduction via a loudspeaker layout over a 
listening area, comprising applying the encoded audio 
signal to matrix means arranged to decode the signal, and 

10 outputting the signal in a form suitable for 

subsequent reproduction via the loudspeakers, 

the coefficient of said matrix means being such that 

at a predetermined listening position in the listening area 

the reproduced velocity vector direction and the reproduced 
15 energy vector direction are substantially equal to each 

other and substantially independent of frequency in a broad 

audio frequency range, 

characterised in that the reproduced velocity vector 

magnitude r v of a decoded audio signal varies 
20 systematically with encoded sound direction at frequencies 

in the region of and above a predetermined middle audio 

frequency. 

34. A method according to 33, in which low audio 
frequencies of the encoded audio signal below a 

25 predetermined cross -over frequency are decoded by first 
matrix means, and high audio frequencies above the cross- 
over frequency are decoded by second matrix means different 
in effect to the first matrix means, the broad audio 
frequency range in which the reproduced velocity vector 

30 direction and the reproduced energy vector direction are 
substantially equal to each other and substantially 
independent of frequency encompassing the cross-over 
frequency; and 

the reproduced velocity vector magnitude r v varying 

35 substantially with encoded sound direction at frequencies 
in the region of and above the cross-over frequency. 
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35. A method of encoding and decoding an audio signal, in 
which the audio signal is encoded as at least three 
linearly independent combinations of an omnidirectional 
signal W with uniform gain for all directions and two 
5 directional signals X and Y pointing in orthogonal 
directions, the signals X and Y having figure-of-eight or 
cosinusoidal directional gain characteristics, and the 
signal is subsequently decoded by a method according to 
claim 33 or 34, 

10 36. A method of encoding and decoding an audio signal 
according to claim 35, wherein the reproduced pressure 
signal at the predetermined listening position is at all 
frequencies a linear combination ayW+byX of W and X whose 
relative proportions a u :by vary with frequency, the 

15 reproduced forward -pointing velocity signal at the 
predetermined listening position is at all frequencies a 
linear combination a x W+b x X of W and X whose relative 
proportions a x :b x do not vary with frequency, and the 
reproduced sideways-pointing velocity signal at the 

20 predetermined listening position is at all frequencies 
proportional to Y. 

37. A method of encoding and decoding an audio signal, in 
which the audio signal is encoded as two independent 
complex linear combinations of an omnidirectional signal W 

25 with uniform gain for all directions, and at least two 
directional signals X and Y, pointing in orthogonal 
directions representing sounds encoded with figure-of-eight 
or cosine directional gain characteristics, and the signal 
is subsequently decoded by a method according to claim 33 

30 or 34. 

38. A method of encoding and decoding according to claim 
37, wherein the reproduced pressure signal at the 
predetermined listening position is at all frequencies a 
linear combination ^W+b^+jCyY of W, X and Y, whose 

35 relative proportions a u :b u vary with frequency, and the 
reproduced signal representing forward-pointing velocity at 
the predetermined listening position is at all frequencies 
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a linear combination a^b^c,,* of W, X and Y, and the 
reproduced signal representing sideways-pointing velocity 
at the predetermined listening position is at all 
frequencies a linear combination -■5<x r W-3b r X+c Y Y of W, X and 
5 Y , where the coefficients a w , by, Cy, a x , b x , c*, a Y , b Y and 
c Y are real and may be frequency-dependent and where j = 
-/(-i) represents a broadband relative 90° phase difference. 
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